ON THE INVARIANT SYSTEM OF A PAIR OF CONNEXES* 
BY 


O. E. GLENN 


The theoretical part of this paper treats of a transvectant operation between 
two general connexes 


= *** Arz Aiy Agu *** Asu, 


Bbyz bez Biu Bou ao Beus 


wherein the variables are ternary and (wu) contragredient to (x). This 
operation, defined by polarization, gives all invariant formations of the simul- 
taneous system of two connexes. Our transvectant, of four indices, 
T= 

can be interpreted in terms of convolution operations employed by Gordan 
in his memoirs on the systems of the ternary cubic and quartic forms,t and 
in Maisano’s papert on the concomitants of the first five degrees of the ternary 
quartic. For doubly homogeneous quantics which are in normal form§ 
(§$ 1, (4)) the Formenrethe employed by E. Noether|| includes an ordered suc- 
cession of terms of transvectants 7. As compared with the methods of the 
latter author, who dealt with normal ground-forms and therefore with a 
canonical form of the problem, our algorithm is non-canonical, and therefore 
symmetrical. The present method bears a closer resemblance to that of 
Clebsch and Gordan] than to subsequently developed methods. 

However, as preliminary to the application of the four types of convolution 
which were employed by all of the before-mentioned authors, we apply to 
one of the forms ¢, y to be combined, a single analytic operation of polariza- 
tion, involving 7i-fold operative factors corresponding to the first type of © 

* Presented to the Society, February 27, 1915. 

+Gordan, Mathematische Annalen, vol. 1 (1869), p. 90, and vol. 17 (1880), 
p. 217. 

t Maisano, Giornali di Battaglini, vol. 19. 

§ Gordan, Mathematische Annalen, vol. 5 (1873), p. 104, and Programmschrift 
(Leipsic, 1875). 

|| E. Noether, Ueber die Bildung des Formensystems der terndren biquadratischen Form, 
Journal fir Mathematik, vol. 134 (1908). 

{| Ueber biternére Formen mit contragredienten Variabeln, Mathematische An- 
nalen, vol. 1 (1869), p. 359. 
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convolution, j-fold operators corresponding to the second type, and k- and 
l-fold operators corresponding respectively to the third and fourth types 
of convolution. The substitutions (1) made in this polar giver. This secures 
an ordering of the convolutions corresponding to a given set of indices 7, j, 
k,l, s0 that theorems analogous to those on the difference between two terms 
of a binary transvectant are readily stated and proved fort. The preliminary 
operation of polarization was not employed by Clebsch and Gordan, although 
they, and likewise E. Noether (loc. cit.), employed the four types of convolution 
(under different schemes of ordering) to generate the complete systems which 
they treated.* 

Where it is possible to proceed by applying convolutions between two sys- 
tems which are finite and complete, absolutely or with respect to a set of 
symbolical moduli, 7 yields all invariant formations (§1). Thus if we are 
given the finite complete system of each connex ¢, ¥, then the system derived 
by transvection between these two systems is itself finite and complete. 

In the second part of the paper we employ 7 in deriving a complete system of 
simultaneous concomitants of a quadric p? and a bilinear connex a; a, (§ 2). 
The problem of making reductions within the finite system generated by trans- 
vection in this case is treated from the two standpoints of reduction by iden- 
tities, and by the syzygies connecting the ground forms. It is the author’s 
thought that the present methods present possibilities for extension towards 
a theory for the general orders when an adequate asyzygetic theory of ternary 
forms is available. 

We give below a list of memoirs on the various known methods of con- 
structing ternary systems. 


1. TRANSVECTANT SYSTEMS 


Let ¢, wy be the two general connexes given above. We define the trans- 
vectant 7, of four indices, as follows: Polarize ¢ by the following operator, 


0 \" \" 0 \‘e \" \‘e 
=) ( =) @ x) x) (x; x) 
71 0 v2 0 0 v1 fs] ve 


where 
x) 


* The introduction of normal ground-forms enabled E. Noether to reduce the number of 
processes of convolution to two essential types. 

t Baker, Cambridge Philosophical Transactions, vol. 15 (1889). 
Forsyth, American Journal of Mathematics, vol. 12 (1890). White, 
American Journal of Mathematics, vol. 14 (1892). Mertens, Wiener 
Berichte, vol. 95. 
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etc., and vy; = Oor 1, and 
Le=t, 
t+jsr, k+1lss, 
Substitute in the resulting polar (¢) 

(a) 

(b) yf = (bpu) 

(c) = by p), 

(d) = (Bp) 0), 


and multiply each term of the result by the b,, 8, factors whose symbols are 
not then involved in it. The resulting concomitant we call the transvectant 
of ¢ and y of index (j'4) and abbreviate it as 


T= 
We abbreviate the above polar as 
(¢) = DA A,o A. Ao 


Let ¢,, tg be any two terms of the double mixed polar (¢). These can 


written, respectively, 
Ui, te = U2, 


where y; (t = 1, 2) are the products of the az, a, factors of t; (i = 1, 2), 
and U; (i = 1, 2) are the products of the a,, a, factors of t; (i = 1, 2), 
respectively. Leaving out of account the constant factor A, (¢) is an ex- 
pression of the type 


wherein 


Ay» Ay® A. Av 


*** 


Ly= LU = 


The first theorem to be proved depends upon the difference t; — t2 of two 
terms of (¢). We have 


ty — te = yi( U1 — U2) + U2(11 — 


and in this each parenthesis is the difference between two terms of a simple 
mixed polar. Now two terms of a simple polar are said to be adjacent when 
they differ only in that a pair of factors a, a;z in one is replaced in the other by 
Qy2z4jy. By a known theorem we can add and subtract terms within each 


407 
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parenthesis so that we will have 

in: + — 97) + + — 

— + — + + — Ua), 


U, — U2 


bo 
— 


where each bracket is a difference between two adjacent terms of a simple 
mixed polar. 

Definition. The sum of the order of a form in the variables (2) plus its 
order in the variables (w) is called its grade. 

Definition. The process by which we obtain from @ = +++ Q1y 
any one of the three covariant types 


= (a, a2 Arz Alu *** 


(ay A1z Arz *** Asus 


II 


C3 = *** Arz *** Assy, 


is called convolution. 
THeoreM 1. The difference between any two terms t,, tz, of a transvectant 


T= 


equals a sum of terms each of which is a term of a transvectant 7 of two forms ¢, 
y, at least one of which is of lower grade than the corresponding one int. More- 
over, are obtained from respectively by convolution. 

To prove this theorem observe that the differences involved in (2) are 
essentially of ten types, and except for factors which may be left out of account 
in pursuing the argument these are 


(A) der — Any? Anz, (D) — Any”, 
(B) — dey any, (E) diy? — Oey? Anz, 
(C) dey? — 


Substitution from (1) in these differences gives all types of expressions which 
occur in the difference between two terms of the transvectant r. The results 
of the substitutions are given below, (A) going into (A,), (B) into (B,), 


etc. 


(A1) — Akg, Anz = (Ana (Bp@)), 


(Bi) aig, dep, — = (Gn (BpBq)), 
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Ong, (A — Ag, (Ar bau) = (a, bg) Bou — (ay, U) 
(Di) (anbpu) (ax — (ae bp u)(anbgu) = (a,a,u) (bp bgu), 
(E,) (ap, by U) Ake ( bp U) Anz = (ap ak u) byz —_ (ap b, ) ts. 
The five expressions (Bi), (11), (4:), (E1) are obtainable from the 
respective expressions (A), (Bi), (C1), (Di), (41) by the substitutions 


a, bp bg Bp Ba 
The right-hand sides of these expressions represent terms of transvectants as 


follows; the particular convolution from @¢ in which the factor a;z az is re- 
placed by (a, a, u) being indicated by (a, a, wu)’, and so forth: 


(Ai) ( (ap u)d¢’, 

(Bi) ( (a, u)¢’, (Bp Bax) 

(Di) (bp bgu)y er’, 

(Ei) ((anaeu)d’, and us. 
Inasmuch as (Ai), ---, (E;) arise from (A,), ---, by (8), the prin- 
ciple of symmetry in the theory now immediately shows that the theorem is 
true in all cases, since (a, a, u)@’ is of lower grade than @ and (a, a 2) yp’ 
is of lower grade than y. 

An immediate corollary is that any monomial concomitant x equals the 
transvectant 7 of which it is a term plus terms of transvectants of forms one 
at least of which is a convolution from the corresponding one in 7 and hence 
of lower grade, times powers of uz. 

Evidently a successive application of this corollary gives for x an expansion 
in the form 


(4) K = Wo + Wi Ur + + 


Gordan and also Mertens derived a similar result and showed that the w; 
were normal forms, i. e. they satisfy the equation 
Ou, | Ox. OU. 


THEOREM 2. Let = A? Bi, = C2 D2, and let be the transvectant 
= (¢, 
Then if it be possible to separate o, W into factors, 
= = Aj, By: Ay 
= CE DI, 


lu 
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and to partition the index numbers, 
t=Uth, k=k+he, l=i+h, 
in such a way that the transvectants 
exist as separate nonsymbolical entities, then 7 contains reducible terms, namely, 


terms of the product 7, T2. 
The transvectant 7 is evaluated by polarizing ¢ by means of the operator 


(ond) (8) 
x x u 
and substituting 
yM=D,- y® = (Cu), M=_C, = (Dr). 


Thus 7 contains the term 


(vx) An( (wz) AZ 
\" 0 \* \" 
(2) (1) (2) 
x =) (« x) =) Br 


= Dy, yP=D., yP=(C2u), 
= Cj, = (D2), Cs, = 
This term is a constant times 
7172 = (dr, Wr (be, 


Let [¢] represent a system of connexes given explicitly by 


di = ati 
and let [y] represent the system 
Yi = BY. (¢=1,2,---). 


Then a system [7] is said to be the system derived from [¢], [Wy] by trans- 
vection when it includes all terms of all transvectants, 


T= (U, V 


where U is a product of powers of forms from the system [¢] and V a product 
of powers of forms of the system [y]. 

THEOREM 3. The number of transvectants of the system [1] which contain no 
reducible terms is finite.* 


* Gordan proved the finiteness of simultaneous systems in 1875. 
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In proof let 


Then the order of U in the p = p; + «++ + pa distinct a symbols is 
Pi @1 + + +++ + Dawa, 

and the order of U in the p distinct a symbols is 
+ Powe + +++ + 


The total effect of the transvection r upon U, V is to produce a sequence of 
terms by folding 7 8 symbols and j b symbols into convolution combinations with 
a symbols, and by folding kb symbols and 18 symbols into convolution com- 
binations with a symbols. In one of these terms ¢; of 7 let the number of a 
symbols not in convolution combination with other symbols be p;. Likewise 
let the number of a symbols of t; not so combined be p2, the number of b 
symbols not combined be p3, and the number of 8 symbols not combined with 
other symbols be ps. Then we have © 


Pi + Powe + +++ + = pi tit], 
(5) + Powe + Pata = tk+l, 


+ G2 V2 = pa titl. 


We thus have a system of four linear diophantine equations in the variables 
Piy ***» Pay Ply ***y k,l, to be satisfied in positive in- 
tegers. Each reducible solution of this system corresponds to a transvectant 
7 having reducible terms, by the theorem last proved. But the number of 
irreducible solutions of such a system is finite by a well-known theorem. 
Hence the theorem is proved. 

Definition. A system of connexes containing an infinite number of individ- 
uals is said to be finite when there exists a finite set of the connexes such that 
every connex of the system is a rational integral function of the members of 
the set. 

Definition. A system is complete when any form obtained from a product 
of connexes of the system, by convolution, is expressible rationally and in- 
tegrally in terms of connexes of the system. 

A proof of a theorem now follows readily from the properties of transvec- 
tants established above, giving a method of constructing the finite system of a 
pair of connexes. 

THEOREM 4. [If two infinite systems of connexes [¢], [W] are each finite and 
complete then the system [1] derived therefrom by transvection is finite and com- 
plete. 
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To prove the finiteness we have only to arrange the transvectants of [7] in 
an ordered sequence arranged according to ascending degrees, those of the 
same degree being ordered according to ascending grade of the product UV, 
where the typical transvectant of the sequence is 

r= (U,V 
Since the difference between any two terms of 7 is a sum of terms of trans- 
vectants, 

7= (U,V 
where UV is of lower grade than UV, a set of connexes in terms of which all 
others of [7] may be rationally expressed may be obtained by selecting terms 
of the transvectants in this sequence, and precisely one term from each 
transvectant which contains no reducible term. But the number of trans- 
vectants in [7] which contain no reducible terms is finite. 

To prove completeness let the finite set described above be Ki, ---, Kz, 
and let P be any monomial form obtained by convolution from 


P = Kt... K*, 


Then P is a term of a transvectant 


(U,V 


and in which U is obtainable by convolution from [¢] forms.alone, and V 
from [y] forms alone. Hence 


P= ut(U,V)bi + Soul (U,V) 
But [¢], [W] are, by assumption, complete systems, and consequently all 
transvectants on the right belong to the system [7]. Hence [7] is complete. 
2. FUNDAMENTAL SYSTEM OF A CONIC AND A BILINEAR CONNEX 


We let F = p? = qi = --- be a quadric, L = (pqu)* its contravariant, 
D = (pqr)* its discriminant . The connex f = a; a, = 6:8, = --- has the 
fundamental system* 


f,fi=a2Bub,, h=Buryu(acu)d,, 
t= 4,, = ag b,, lg = dg b,c. 
The simultaneous system of F and f is the transvectant system given by 
fifig? 
together with the invariants 7, 7%, 72, D. There are, in addition to the 


*Clebsch and Gordan, Mathematische Annalen, vol. 1 (1869), p. 359. 
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members of the two individual systems, precisely 92 transvectants of this for- 


mula not immediately reducible by theorem (2) of § 1. 


This is exclusive of 


the large number which are reducible by the known syzygy 


— it” uz + fo 

if - 
| 

| 


t’ ths + fe 
f 
), 


fe 


Uz 


Uz 


fo =fi uz. 


Of the 92, twenty-three are reducible by the identity 


4, 


(abu) (pox) = b, b, 


as 
(6) 
Pu Gu 


which is applicable in most cases where | > 0. 
arranged in the following table according to 


az 
bz |, 
Uz 


The remaining 69 forms are 
ascending grade of UV, in 


accordance with the theory. 
TABLE 


Grade 4 


(F, foo = (pau) ay pz, 

(F, fidoo = Pp Pr Gz, 

fi)oo = Pp ba (pau), 
(L,f)or (pqu) (pqax)az, 
(L,f)ti = (pga) (pqex), (L, fi)ty = ba(pqa) (pqu) Bu, 
(L, fidor = ba(pqu) (pqBe)az,  (L, fi)it = b.(pqa) ( 


Grade & 


(F, 
(L, g 00 


20 
Grade 6 
10 


(FL,fi 10 
(F, f? yoo 
(F, f 
(F ffi 
(L, f? 
(L, fi) 

(L, ffi 


(F, foo = Pa Pz ez; 

oo = (pau) pa, 
(F, fi b, (pau) Bu pz; 
(L, = (pqa) (pqu) au, 


a Pp Py (acu), 
a (pga) (pge) (Byx). 


(F, = ba pp (acu) Yu pz; =b 
(L, = ba (pga) (pqu) (Byx) ez, =b 
(FL, fio 
(Ff? Yoo 
(PF, 
(F, ffi 
(L, f?) 2 
(L, fi) 

(L, ffi 


= b. pp (gra) (qru) pz, 
(pau) (pbu) au Bu; 

bd, (pau) (peu) Bu 
= b, (pau) (peu) Bu Yu, 
( pqax ) ( pqBx ) az bz, 
b,d, ( pqBx ) ( azcz, 
b, ( pqBx ) ( pqyx) az ez. 


= pa(qra) (qru) pz, 
Pa Pp az 
b. Pa Psd, Az Cz, 
= Pp Py Cz; 
= (pqa) (pqb) au Bu; 
= b, d,(pqa) (pqe) Bu bu, 
= b,(pqa) (pqe) Bu Yu; 


00 
20 
00 
20 


00 
02 
00 
02 


| | 
| = 
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Grade 7 


(F?, 9 oo = ba (pau) (peu) (qByu) qe, 

(F?, = ba Pp Py(qae) qr, 

= b.(pqa) (pqe) (rsBy) (rsu), 
h)os = b.(pqBx) (pqyx) (rsacx) (rsu), 
(LF, = 6.(qre) (qrBy) (pau) pe, 

(FL, = Dp p, (qru) (grace). 


Grade 8 


(FL, = Pa Da (gra) (qru) bz, 
(FL, = Pa Pp (gra) (grb), 

(FL, = pa(qra) (grb) pz Bu, 
(FL, fi)io = Dp Psd, (gra) (qru)ez, 
(FL, = b Ps Ps d, (gra) (qre), 
(FL, = Pp d,(qra) (gre) 5, pz, 
(FL, ffi b, Ps Py (gra) (qru)ez, 
(FL, = b. Pp Py (gra) (gre), 
(FL, = ba Pa (gra) (gre) Yu 


Grade 9 


(F?, fg)oo = 5. (pau) (peu) (qdu) (qByu) 
(F?, fh)o, = ba Pa Py Qs (qac) dz, 
(F?, fig)oo = bes (pau) (peu) (qdu) (qByu) €u, 
(F?, = Pp PyGe(gac)dz, 
(L?, fg) = b.(pqa) (pge) (rsd) (rsBy) 
(L?, = b. (pqBx) (rsbx) (rsacx) dz, 
fig)io = ba es (pga) (pqe) (rsd) (rsBy ) eu, 
= es (pax) (rsex) (rsacx) dz, 
(FL, fg)25 = ba (rd) (qrBy) (pau) (peu) du, 
(FL, fh)o2 = b. Ds py (qréx) (gracx) dz, 
(FL, = b. es (qrd) (qrBy) (pau) (pew) €u, 
(FL, = ba es Da Py (Grex) (qracx) dz. 


Grade 10: No irreducible forms 
Grade 11 


(F°L, fh)i, = ba Dp Py (qac) (rsd) (rsu), 

(FL*, = b.(qra) (qre) (std) (stBy) ps De, 
(F°L, fih)io = ba €s Pp Py Ye (gac) (rsd) (rsu), 
(FL?, fig)i} = es (qra) (gre) (std) (stBy) p. pe- 


= 


[October 
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Grade 12 
= €s (paw) (peu) (qdu) (gfu) (rByu) (requ), 
(L*, 9?) = b. es (opa) (ope) (grd) (grf) ( stBy) (sten), 
(F*, h? = De Py Ve (rac) (rdf), 
= ba es (opBx) (opyx) (qrex) (qrnx) (stacx) (stdfx) , 
L, = ba es (rsen) (paw) (peu) (qdu) (gfu), 
(FL?, = es (qrd) (qrf) ( stBy) (sten) (pau) (peu), 
(F? L, = es Dp Py Ve In (rsacx ) (rsdfx) , 
(FI?, = ba €s Dp Py (qrex) (qrnx) ( stacx) (stdfx). 
In order to illustrate the process of reduction of the transvectants which 
are reducible by (6) but not by theorem (2), § 1, let us consider 
t= (FL,h)}. 
This has the term 
t = b, pg (acu) (qru) (qryx) pz, 
and 
| (ara) a, dz 


cy 
Yu Ue 
We treat each term of this ¢ separately. Thus we deduce 
(qra) (qru) bs Pp Pz ty Us = t+ Uz (FL, fi)io, 
(gre) (qru) Yuba Pp = (Fs fidoo 
(qru)? b, a, Pg P2 Cr = L+ (F, a, Bu Cz ba) 
In the latter case we have* 
Bub. Ce = + Qe — 3it1) — f — — if). 
Hence (F, a, 8. ¢zb,))) is reducible in terms of invariants and 


(F, fi F. 


Furthermore, 


1 


(qru)*c,b. Pp =t-L- (F,fi 


(gra) (qru) Pp =f (FL, fi)is, 
(gre) (qru) b, ay Pp Pz Uz = Us * (FL, a, By cz 
The latter form is reducible in terms of uz, (FL, f)i$, (FL, fi)i$, and in- 


variants. 
We have now proved that the term ¢ of + is reducible. 


* Clebsch and Gordan, loc. cit., p. 374. 
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A different procedure is required in the case of transvectants of the type of 


r= 


This form 7 has the term 


t = (acu) (pgBr) (pqyr)b., 
and ¢ equals 


(pqa) dg ar 


b.(payx)|(pqe) Cp Cz’. 
Ba 


Expanding the latter determinant we have for one of t’s terms 

t = (pqa) (pqyx) Bu Cz ba, 
and ¢, is a term of the transvectant 

m1 =(L, fh 
Another term of 7; is the reducible form 
te = (pga) ba =f + (L, 
and we have, in accordance with the theory of § 1, 
ty — te = (pqa)b.cz | Bu — yu) 
= (pqa)(pqu) (Byx) ba — (pga) (pgBy) cz Us. 

The terms of this last difference are terms of the respective transvectants 

(L,g)io, (L,g)2- us, 
and these are retained in the above table. Thus ¢; is reducible, in the form 


where > is a sum of terms of transvectants u‘(L,g)j, g being derived by 
convolution from g and hence of lower grade than g and thus linear in f, fi, 
u, and invariants. The other terms of the expanded ¢, with minus signs 
changed, are 
(pqa) Uz = (L, eg Uz 
= (L, a, Buz Ue, 
(pace) (pave) az Bub. = fi(L, 
00 


(pqu) (payx) dg bg = (L,f)or, 
(pqu) (pqyx) az Cg b, (L, az b, Cg 


00 


(pqe) (pqyx) dg b.u2 =u (L,f)in. 
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Hence the term ¢ of 7 is reducible in terms of forms of lower degree, together 
with forms which are retained in the table and forms which are terms of trans- 
vectants for which the grade of UV is less than the grade of Lh. Hence 
t = (L, h)2 does not belong to the fundamental system. 

The remaining 21 reducible transvectants are as follows: 


(FL, (FL, fide, (FL (FL, (FL, 
(FL, (FL filo, (FL FIG, (FL, 
(FL (Fg), (Lik), (FL, 
(FL, (FL, h)e, (FPL, fg), (FL*, (PL, figdtt, 
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ALGEBRAIC PROPERTIES OF SELF-ADJOINT SYSTEMS* 
BY 


DUNHAM JACKSON 


The general definition of adjoint systems of boundary conditions asso- 
ciated with ordinary linear differential equations was given by Birkhoff.f 
In a paper of Bécher,tf in which the idea is further developed, there is obtained 
a condition that a system of the second order be self-adjoint. It is proposed 
here to extend the discussion of this problem to the case of systems of any 
order.§ A condition for self-adjointness of the boundary conditions is simply 
expressed in matrix form, without any requirement that a corresponding prop- 
erty be possessed by the differential equations; the condition gains in sym- 
metry if it is assumed that the differential expression which forms the left- 
hand member of the given differential equation is itself identical with its 
adjoint or with the negative of its adjoint. 

As a preliminary, it will be well to recall a well-known rule for the combina- 
tion of matrices, which will be found particularly convenient. Let a1, ae, 
a3, and a, be n-rowed square matrices. The square matrix of order 2n which 
is made up of the elements of these four, with the elements of a,, arranged 
in order, in its upper left-hand corner, and the elements of the other matrices 
correspondingly disposed, may be indicated by the notation 


a= 

Q3 

* Presented to the Society, December 27, 1915. 

+ G. D. Birkhoff, these Transactions, vol. 9 (1908), pp. 373-395; p. 375. 

tM. Bocher, these Transactions, vol. 14 (1913), pp. 403-420; p. 409. Cf. E. 
Bounitzky, Journal de mathématiques pures et appliquées, 6th 
series, vol. 5 (1909), pp. 65-125; pp. 112-116; W. Stekloff, Sur certaines questions d’analyse 
qui se rattachent & plusieurs problémes de la physique mathématiqu, Mémoires de 1’A- 
cadémie Impériale des Sciences de St.-Pétersbourg, classe physico- 
mathématique, 8th series, vol. 31 (1913), no. 7; pp. 5-6, 69. 

§ Bounitzky, in the paper just cited, pp. 107-116, gives a discussion of the problem, laying 
stress primarily on the case of a self-adjoint differential equation. His method of treatment 
is different from that employed here, and in fact bears as little resemblance to it as could 
well be expected, considering that the author’s task in either case, as far as this particular 
problem is concerned, is merely to give a concise account of algebraic processes which in them- 
selves are perfectly straightforward. 
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Let 8 be another matrix of the same sort, consisting of the elements of the 
n-rowed matrices 6;, ---, 84. The notation suggests the symbolic product 


as a4/\Bs Bs a3 Bi +0183 a3 Bo + ag 


formed as if the elements indicated were ordinary numbers instead of matrices. 
By actually writing out the n-rowed matrices a; 8; + a2 B3, etc., it is possible 
to interpret the product as a square matrix of order 2n; and it is found that 
this is the same as the product obtained by multiplying together a and 8 as 
2n-rowed matrices in the usual way. 

To begin the discussion of the problem in hand, let Z(w) stand for the 
differential expression 


L(u) = po(2) + (2) + + 


where n is any positive integer. The coefficients p are real or complex func- 
tions of the real variable x, defined in an interval a S x Sb, and it may 
be assumed, with more regard for simplicity of statement than for ultimate 
generality, that they are continuous together with their first n derivatives. 


With the differential equation 
L(u) =0 


there is supposed given a set of n linearly independent boundary conditions 
U;(u) = 0,7=1,2,---+,m, in which the general left-hand member is a 
linear combination, with real or complex constant coefficients, of the 2n 
quantities u(a), u’(a), ---, uw" (a), u(b), uw’ (b), Let 
n of these quantities be chosen in such a way that the determinant of the 
corresponding coefficients is different from zero. The possibility of such a 
choice is a consequence of the assumption that the n conditions are linearly 
independent. The quantities selected will be denoted by ™, we, «++, Un; 
the remaining m quantities by wn41, Wen; the order of the subscripts within 
each group is immaterial. It is most natural to assign the first n subscripts 
to u(a), «++, u™ (a), or to u(b), ---, (bd), if the corresponding 
sets of terms are linearly independent, but this will not always be the case. 
The differential expression adjoint to L (w) is 


n— 


d” qr 
M(v) = (— 1)" (por) + (— 1)"* (Pi?) + + 


a” 


It is connected with L(u) by Lagrange’s identity, 


oL(u) —uM(v) = £ v), 
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where P(u,v) is a bilinear form in u(x), u’(x), ---, (a); v(2), 
v' (x), --+, v0" (2), with coefficients which are functions of x. Integrated 
with regard to x from a to b, this identity gives a relation involving the values 
which the functions just named take on at the ends of the interval. If she 
values of v and its derivatives at a and b are denoted by 1, ---, tn, in such 
a way that w; and 2; refer to derivatives of the same order taken at the same 
point, the relation becomes 


— uM dx = Il (u,v), 


in which II (u,v) is a non-singular bilinear form in the 4n quantities u;, v;, 
with constant coefficients. * 

The definition’ of the adjoint boundary conditions proceeds now as follows. 
In addition to the linear forms U,(u), ---, U, (uw), there are defined n other 


forms Unii(u), Uon (uw), which are arbitrary, except for the restriction 
that the whole set of 2n forms shall be linearly independent. Then 2n linear 
forms V;(v), ---, Von(v), in the variables v;, are determined so that 


Il(u,v) = Ui(u) Vile). 


The last n of these forms, set equal to zero, express the adjoint conditions. 
It is well known that a different choice of the arbitrary forms Un4;, ---, Un, 
would lead to a set of conditions equivalent collectively to those already 
obtained, that is, a set of independent linear combinations of them.f It is 
to be kept in mind throughout that it is the set of equations that is important, 
and not the individual equation, still less the individual linear form. 

The definition just given can be expressed in slightly different words, as 
follows: The variables u are subjected to a linear transformation u; = U; (2), 
and then a linear transformation of the other set of variables, 1; = V;(v), 
is determined so that the two transformations together carry over the bilinear 
form II (u, v) into the normal form }°u; v;. Let II denote the matrix of the 
coefficients in the form II (uw, 7); the double use of the letter II, to represent 
a matrix and to represent an operator, will not cause any confusion. Let a 
denote the matrix of the coefficients in the inverse transformation, by which 
the u’s without accents are expressed in terms of the u’s with accents, and 
let 6 be the corresponding matrix for the v’s; the conjugate of a@ will be de- 


* It is to be noticed that the coefficient p, is without effect on the form of II, as the term 
p» uv cancels from the difference vL (u) — uM (v). Consequently, if the differential equa- 
tion with a parameter, L (uw) + Au = 0, is under discussion, the form II is independent of 
the parameter, and the same is true of the adjoint boundary conditions, which depend on II 
for their definition. 

t+ Birkhoff, loc. cit., p. 375; Bécher, loc. cit., p. 405. 
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noted by a’, and, in general, conjugate matrices will be indicated by accents. 
Finally, let J denote the unit matrix of order 2n, having unity for each ele- 
ment in the principal diagonal, and zero everywhere else. Then the relation 
connecting these matrices is simply the equation* 


(1) ae’ = I. 


It is to be noticed that not a, but a, is to be regarded as given directly, 
and 6” is the matrix that is ultimately desired. An expression for the latter 
is found by multiplying both members of (1) on the right by 6”: 


=a’ Il. 


Hence the problem to be solved may be formulated thus: 

Under what circumstances will the last n rows of a’ II be independent linear 
combinations of the first n rows of a1? 

Let a; stand for the square matrix of the coefficients of the first n variables 
u; in the n given forms U;, and a» for the matrix of the remaining coefficients 
in these forms. Since the notation has been supposed chosen so that ay 
is non-singular, the 2n forms U,, ---, U2, will be linearly independent if the 
last n of them are set equal to wn41, «++, Wen respectively. Then the matrix 


a has the form 
of = ( = 


here J denotes the unit matrix of order n (the use of the same letter to repre- 
sent unit matrices of different orders will not cause any confusion) and 0 stands 
for a matrix composed entirely of zeros. It may be verified at once that 


a a2\far' —aj' a: I 0 
from which it follows that the second factor on the left is the matrix a. The 
product aj‘ a2 will be denoted for brevity by the single letter 5. It will be 
convenient also to indicate the conjugate of the reciprocal of a non-singular 
matrix by a double accent, so that (aj')’ = (a,)7 = a, and = a, a. 
Then we have for the conjugate of a the representation 


a, 
The matrix of the bilinear form will be represented for the present simply by 
the notation 
us us 
( 1 
* Cf., e. g., Bécher, Introduction to Higher Algebra, p. 115. 
Trans. Am, Math. Soc. 28 
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which is applicable to any square matrix of order 2n; with regard to the special 
form of II, more will be said later. The expression for 8 is 
Qa, a, 73 

Now, since a is non-singular, it is possible to express any sets of n quantities 
each, in particular, the rows of the matrix standing in the lower left-hand 
corner of a’ II, as linear combinations of the rows of a;; and the coefficients 
by which this is accomplished are uniquely determined. The question is, 
then, whether the corresponding linear combinations of the rows of a are 
identical respectively with the rows of the matrix in the lower right-hand 
corner of a’ II. It will be seen presently that if this is the case, the combina- 
tions in question will necessarily be independent. 

In general terms, if 7 is a matrix of order n, and there are formed n linear 
combinations of its rows, the coefficients in these combinations forming the 
successive rows of a matrix vy, the resulting sets of m quantities each are 
respectively the rows of the product matrix yy. In the present instance, 
7 is to be determined so that 


(2) yo, = m+7%, 


and the condition for a self-adjoint system is that the matrix y so defined 
satisfy the further relation 

(3) 7a; = 6’ + T2- 

If it does, then 


™ ( I 0 ) Oy 
Il = = 
YO 0 ¥ ay 
If y were singular, the matrix a’ II would be singular, which is certainly not 


the case; it follows that the determinant of y is necessarily different from zero. 
Let (2) be solved for 7, and the result substituted in (3). It is found that 


(4) y= — 8 + mm a7', — 7,6 = 7m. 


In the last equation, 6 has been substituted in the left-hand member for its 
equal, a;' a. The relation (2), defining 7, may be regarded as valid in any 
case. Then either of the relations (3), (4), is a consequence of the other. 
The conclusion may be stated as follows: 

THEOREM. A necessary and sufficient condition that the given set of boundary 
conditions be self-adjoint, is that* 


(5) 6’ 716 — = 13 — Te. 


* Professor W. A. Hurwitz points out that this condition can be written concisely in the 


for m A’ iA = 0, where 
( ) 
0 I ‘ 
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In the cases that are of most interest, the form of the condition can be 
simplified. Suppose first that in the nm given boundary conditions, the sets 
of terms corresponding to one end of the interval are linearly independent, 
so that the values of wu and its derivatives at this point may be represented 
by uw, °**, Un. Then, as no single term of II (u,v) is concerned with both 
ends of the interval, the subscripts in each term that actually appears will 
be both less than n + 1 or both greater than nm, and the matrices previously 
denoted by 73 and 7, will be composed entirely of zeros.* If 6 is replaced by 
the alternative expression a;' a2, the condition for self-adjointness reads as 
follows: 

It appears that in this case a2 as well as a; must be non-singular, since 72 is 
non-singular. On multiplication before and after by a and az’ respectively, 
the condition takes the form 


” | ” | 


the meaning of which can be expressed in still different language. There 
is no loss of generality in assuming, for definiteness, that a, is the matrix of 
the coefficients belonging to the point b. More precisely, it may be assumed 
that 

u(b), U2 u’(b), =u") (db), 


Unti = u(a), = u'(a), =u") (a), 


while 71, +++, Yn have a corresponding interpretation. Let the value of 
P(u,v) for = b be denoted by II;(u,v), and its value for x = a, by 
II, (u,v), so that 


II(u,v) = I,(u,v) — Ie(u,v). 


The expressions II, and Iz are bilinear forms in 1, --+, %1, and 
Untiy Untiy ***, Yon, respectively, and their matrices are 7; and 
— 72. Let each of the n given linear forms U;(u) be regarded as the sum 
of a form U;;(u), involving only the first n of the variables u;, and a form 
Ui2(u), involving only the last n variables. If Il,(w,v) is written as a 
bilinear form in (wu) and Uy, (0), = 1, 2, ---, 7, the coefficients in the 
representation have the matrix a’ 7, a;'. A corresponding remark applies 
to IIz(u,v). It follows that 

If the forms Uj, (u) are linearly independent, the condition for a self-adjoint 
system is that the coefficients by means of which TI, (u,v) is expressed in terms 
of the linear forms Ui (u), Uin(v), be the same as those by which (u, v) 
is expressed in terms of the forms Uj2(u), Ui2(r). 


* The fact that x; and 72 also contain a considerable number of zeros need not be taken into 
account. 
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Now let the restriction on the choice of the first n variables be removed, 
but let it be supposed that when the given differential expression L(w) and 
its adjoint are written with the same function wu as argument, the latter ex- 
pression is identical with (wv) or with the negative of L(w), in other words, 
that L (1) is self-adjoint or anti-self-adjoint. The statement of the theorem 
is again capable of some simplification, for the reason now that the bilinear form 
P (u,v) is skew-symmetric in the one case and symmetric in the other,* 
and the same thing consequently is true of the matrix Il. More generally, 
it suffices for the symmetry of P and II that ZL (~) differ only in the term p, u 
from an expression which is anti-self-adjoint,} since, as has already been 
remarked, this term has no effect on the form of P. 

Let the condition (5) be written in the equivalent form 


(6) 8 6 — + = — 8 6 + 25’ — 


If Il is skew-symmetric, then 7; = — 7, m2 = — m2, and 7 = — 73, and 
it appears that the matrix which forms the right-hand member of (6) is the 
conjugate of the matrix on the left. If II is symmetric, each member of (6) 
is the negative of the conjugate of the other. That is: 

If the differential expression L (x ) is self-adjoint, the condition that the boundary 
conditions be self-adjoint is that the matrix 


6’ 716 — 2746 + me 


be symmetric; if L(u) is an anti-self-adjoint expression, or differs from such 
an expression only in the term of order zero, the condition is that the matrix just 
written down be skew-symmetric. 

This form of statement remains valid, of course, under the earlier hypothesis 
that +, = 0; but in that case the condition simply resumes the form 


6'7,56+7=0, 


since this matrix is required to be symmetric and skew-symmetric at the 
same time.t 
HarvarpD UNIVERSITY, 

CAMBRIDGE, Mass. 

*Cf. L. Schlesinger, Handbuch der Theorie der linearen Differentialgleichungen, vol. 1, 
pp. 70-71, 73-74. 

t This observation is not trivial, as the corresponding statement with regard to self-adjoint 
expressions would be; for example, the expression zu’ + 3u is anti-self-adjoint, while the 
expression xu’ is not. 

t It is necessarily skew-symmetric if L (1) is self-adjoint, without regard to the boundary 
conditions, and becomes symmetric if the latter are self-adjoint, and similarly in the anti- 
self-adjoint case. 


ON THE SECOND DERIVATIVES OF AN EXTREMAL-INTEGRAL 
WITH AN APPLICATION TO A PROBLEM WITH 
VARIABLE END POINTS* 


BY 


ARNOLD DRESDEN 


INTRODUCTION 


In a paper published in volume 9 of these Transactions, formule 
were obtained for the second derivatives of the extremal-integral arising in 
the problem of minimizing or maximizing the integral f F(x, y, 2’, y’)dt. 
In these formule the derivatives were expressed in terms of particular solutions 
of the Weierstrass form of Jacobi’s differential equation. They were used 
in the same paper to obtain further necessary conditions for a minimum of the 
above mentioned integral, when one or both endpoints were variable along a 
curve,{ or when curves with discontinuous slopes were admitted as solutions 
of the problem. It is the author’s aim to use the same general method in 
analogous problems involving integrals of a more general type, and also in 
the cases in which the unknown functions are further conditioned by differ- 
ential and algebraic equations. As a first step in this direction, we obtain in 
the present paper the second derivatives of the extremal-integral arising in 
the theory of the integral 


(1) ( yi’ = dy:/dz), 


and express them in terms of particular solutions of the differential equations 
which correspond to Jacobi’s equations for the simpler problem. The formule 
for these derivatives are then used for finding further necessary conditions to 
be satisfied by a curve C in the space of n + 1 dimensions which is to furnish a 


* The contents of sections 1 and 2 of this paper are contained in the paper presented to the 
Society at the Chicago meeting of December, 1914, an abstract of which appeared in the 
Proceedings of the National Academy of Sciences for March, 1914. 

t Compare Bolza, Vorlesungen tiber Variationsrechnung, p. 306-313 (this book will be re- 
ferred to in the sequel by the author’s name only), and Dresden, these Transactions, 
vol. 9 (1908), pp. 467 and 472. 

t The conditions here referred to had been found before by Bliss, these Transactions, 
vol. 3 (1902), p. 136, and Mathematische Annalen, vol. 58 (1903), p. 70. 
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minimum for the integral (1), when the initial point of C is allowed to move 
along a fixed curve L.* 

A number of familiar facts concerning the integral (1) which will be needed 
in the development of our problem have been collected in section 1. In 
section 2 the formule for the second derivatives of the extremal integral are 
obtained, and in section 3 these formule are applied to the problem with 
variable initial point. 

Concerning the function f , which appears in the integral (1), we assume that 
it is of class C’” with respect to all of its arguments; the functions y (2x) 
which are admitted as solutions of our problem must be of class C’’. We shall 
moreover use the following abbreviated notations: 


fo = Of/dx, fi =Of/Oyi, = Of/dyi, 


and similar notations will be used for the second derivatives. A point in 
n + 1 dimensional space of coérdinates yi, Yn will be‘designated by 
(x; y), and similar notations will be used for sets of numbers which can be 
interpreted as coérdinates of points in spaces of higher dimensionality. Finally 
the notation f” will be used to indicate that the function f(z; y; y’) is to be 
taken with the arguments (2;; yi; yi). 


1. PRELIMINARY DEVELOPMENTS 


Any set of functions y;(2), «++, Yn (2) which is to furnish a minimum or 
a maximum for the integral (1) must satisfy the system of differential equa- 
tions 


known as Euler’s differential equations.t Such a set of functions defines a 
curve in the space of n + 1 dimensions, called an extremal for the problem; 
we shall use repeatedly geometric terminology consistent with this inter- 
pretation of the set of functions y;(x). From the general existence theorems 
for differential equations,t we know that the general solution of the system 
of differential equations (2) is of the form 


(3) Ys = gil a1, On; Bi, Bn) (i =1,-+-,n), 
the functions g; being of class C’” with respect to all of their arguments and 
a1, being 2n arbitrary constants. 


* For the case n = 2, this problem has been treated by Miss M. B. White in her paper, 
The dependence of focal points upon curvature for problems of the calculus of variations in space, 
these Transactions, vol. 13 (1912), p. 175. 

t See Bolza, p. 542. 

t Ibid., pp. 175-179. 
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We suppose now that we have found an extremal passing through the two 
points A (ao; a) and B(bo; b) and satisfying the conditions of Clebsch and 
Mayer.* ‘The second variation of the integral (1) can then be put into the 
form 


2 


where 71, «++, mm are arbitrary functions of x of class C’ vanishing for x = ao 
and for t= bo, and where Px = fa, Ox = fi. n+k and Ri = Sari, n+k» these 
functions being taken along the given extremal. If the integrand which ap- 
pears in this formula for the second variation be denoted by 2Q, and if we write 


ond 
¥i(n) = 
we obtain, by means of Euler’s theorem on homogeneous functions and by the 
familiar integration by parts, the following result: 


The system of equations 
(4) ¥i(n) = 0, (t=1,-+--,n), 


is a system of linear differential equations of the second order in the unknown 
functions 7;; it is a self-adjoint system and is called, after Von Escherich, the 
accessory system of differential equations;{ we observe moreover that it is 
the direct analogue of Jacobi’s equation for the simplest problem of the cal- 
culus of variations. It follows further§ that a fundamental system of 2n 
solutions of the equations (4) may be obtained by differentiating the functions 
gi, Which appear in the equations (3), with respect to each of the parameters 
a,, °-:, 6, and then replacing these by aio, --+, Bao, the latter being the 
values which furnish the given extremal if put in place of a1, ---, 8, in equa- 
tions (3). We obtain in this way the following 2n systems of solutions of 
equations (4): 


(5) , Ogn/Oai; 091/08; , , (i =1,+++,n), 


the arguments of these functions being z; aio, «++, Bno- 

We consider now two points P; (21; y:) and P2(22; y2) in the neighbor- 
hood of A (ao; a) and B(bo; b), respectively. Since the extremal AB satis- 
fies the Mayer condition, we can conclude that it is possible to construct an 

* Bolza, p. 608 and p. 619. 

+t Hadamard, Legons sur le Calcul des Variations, p. 319. 


t Bolza, p. 622; further references are given there. 
§ Bolza, p. 623 and Hadamard, loc. cit., pp. 336-339. 
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unique extremal through the points P; and P2, i. e., that it is possible to solve 
the equations 


(6) Yui = (21301, Bn), Yoi = gi a1, Bn) 


uniquely for a; and 6; in terms of 2, 22, yi; and yo;.* We denote the solu- 
tions of these equations by: 


(7) A; (21, %23 yn, Yon), Bb; = B; (21, Yi, Yon). 


Substitution of equations (7) in equations (3) furnishes the following analytical 
representation for the extremal through the points P; and P2: 


Yi = gi (2; Ai(n, Yon), Yon) ) 
(8) 


= Y; (x; yur, Yon). 


Upon substituting from equations (7) in equations (6), we see that the func- 
tions Y;, which are defined by equations (8), satisfy the following initial con- 
ditions: 


(9) Y; (413 %1, yu, » Yon) Y; 241, 2, » Yon) = Yr. 


Differentiation of these equations with respect to 21, %2, Y11, -** ,» Yon furnishes 
furthermore the following relations, which will be of use in the sequel: 


OY;/dx,|, + Yi(a) = 0, OY;/dx2|, + Yi(ae) = 0, 
(10) OY; = 0, ™ 0, 
dY;/dx,|,=0, dY;/dy1;|, = 0, OY: /dy2;\ 5 by, 


or, expressed in different form, 
09: OA; 99: 


gi (22) * 0B; 


09; OA; Og; 0g: OA; Ogi | 
Ox, 98; { OxX2 96; Ox2 


0g; OA; Og; Ogi OA; 0g; OB; 
1 


0a; OYth 0a; 0B; 


{ 0A; 4 OB; } { $2 Og; OA; Og: OB; 
Ba; dy 3B; Ayu de; dyn * 38; Opes 
(i=1,- --,n;h=1,- oo, %). 


2 


* Bolza, p. 597. 
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We compute now the integral (1) along the extremal P,P: by substituting 
the functions Y;, ---, Y, and their derivatives with respect tox, for y1, +++, Yn 
and y,, ---, y,, respectively in the integral (1). This integral becomes then 
a function of the 2n + 2 coérdinates of the points P; and P2;* it is known 
as the “ extremal-integral ”’: 


I (x1, %2, yu, Yon) = Y;, Yi, Y;,) dz. 


The first derivatives of this function I with respect to its different arguments 
were first given by Hamilton,* and we shall use them as a basis for the com- 
putation of the second derivatives; we have the following formule: 


OI /da, = —f yi) + Yi) 
(12) = f (x2; yo; y2) — Do (25 Yo) 


= — (13 Yi )» = (X25 Yo; y2) 


where y1, y; are abbreviations for the sets Yin) and Yin) 
respectively. 


2. THE SECOND DERIVATIVES OF THE EXTREMAL-INTEGRAL 


For the determination of the second derivatives of the function I it is neces- 
sary to know the derivatives with respect to 21, «++, Yon of the quantities 
Yin and y21, «++, Y2n Which determine the slope of the extremal P; P2, 
at its end-points. If we denote by z any one of the coédrdinates of the points 
P, and P:, we obtain immediately, upon differentiation of equations (8), 


the result, 
7 \Oa; dz OB; Oz 


from which the desired derivatives may be obtained by differentiation with 
respect tox. Furthermore equations (11) enable us to determine expressions 
for 0A;/dz and 0B;/dz; for we have there a set of 2n linear non-homogeneous 
equations for each of the 2n + 2 sets of 2n unknowns which we get if we let z 
assume in succession each of its values. These sets of equations all possess 
finite unique solutions provided Mayer’s condition is satisfied along the ex- 
tremal P; Pz. That this will indeed be the case, if the points P; and P2 are 
taken sufficiently near to the points A and B, respectively, follows from our 
conditions on the extremal AB and from the continuity of Mayer’s deter- 
minant:f 

* Bolza, p. 599. 

t Bolza, p. 611. 
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O91 O91 9 O91 


dan OB: 


99n 
(14) 
O91 | Og: | 991 991 | 


Aan), 
We denote now by A(z, x2) the determinant obtained from A(z, 21) by 
replacing the substitution symbol |; in the last n rows by |2; by A(a1, 22) 
the result of replacing the first n rows of A(z, x2) by the last n rows of 
A(x, 21); by Aj (x; 21, 2) the result of replacing the jth row of A(a1, 22) 
by the kth row of A(z, 21); and by Vj (x; 21, 22) the result of replacing the 
(n + 7)th row of A(21, 22) by the kth row of A(z, 22). 
With this notation we can write down the results of solving equations (11) 
for 0A;/dz and 0B;/dz; substitution of them in equations (13) leads to the 
following formule: 


Aix (x5 


From these we obtain by direct differentiation the derivatives with respect 
to 21, Yon of the quantities yi1, ---, yz, namely: 


OxOy2; ’ 


(15) 
ix (3 1, 


— 2,9: (22) ae) 


From the definition of the functions A;, and Yj, combined with the properties 
of the functions (5), it follows that the functions 


Uj = Aix (2; x2)/A(a1, iE = Vir (2; v1, x2)/A(2x, 


furnish for every fixed j a solution of the accessory system of differential 
equations, and that they satisfy the following initial conditions: 


(16) = Sn, =0; = 0, = Sy. 


These initial conditions show moreover that each of the systems of functions 
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uj, and v constitute, in the terminology of Von Escherich, a “ conjugate 
system ” of solutions of the accessory system of differential equations. * 

We can now express the second derivatives of the extremal-integral in terms 
of these solutions of the accessory system of differential equations. Differ- 
entiation of equations (12) yields, by the use of equations (10) and (15), the 
following theorem: 

TaeoreM I. The second derivatives of the extremal integral for the integral 


Sf lan, Ys ae 
are given by the formule 
nts Yi Vir Urs (21) 
y 


Is, 2 


~ 

nw 
ll 


Jj J 


= fornss n+j Yij + Yik Un; (21), 
jk 


J 

J J 


Is, 2i =f? Yas Yoj Vik (22), 
J 


+ {2 n+j Y2; Y 2k (2X2 ) ’ 
J J 


* See Bolza, p. 627; Hadamard, loc. cit., p. 339; also von Escherich, Wiener Berichte, 
vol. 107 (1898), p. 1222. 
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where the functions us, and vy, are the solutions of the accessory system of differ- 
ential equations (4) which are characterized by the initial conditions (16). The 
subscripts of I specify the variables with respect to which the differentiations 
are made, single subscripts designating the variable zx , while pairs of subscripts 
designate the y’s. Thus = 0° I/dx2 

From the continuity restrictions imposed upon the function f, it follows that 
the results of differentiating the extremal integral should be independent of 
the order of differentiation and hence that the two different expressions given 
for certain of the derivatives in formule (17) should be equal to one another. 
This fact leads to eight relations between the expressions given in theorem I. 
It is important for certain applications and as a check upon the accuracy of 
the formule themselves, that these relations should be derived directly from 
the properties of the functions u,, and v%. We shall denote by (A), (B), 
(C),(D),(E), (F), (@), and (1/7) the relations resulting from setting equal 
to one another the two expressions occurring in each of formulz (173) to (1710) 
respectively. 

In the first place we observe that certain of these relations follow from 
others; the remaining ones will then be proved to hold as a consequence of the 
properties of the functions u;, and v;,. If in (G@) we interchange the indices 
i and j, then multiply through by y;; and sum with respect to j, and finally 
interchange the indices j and k in the second sum, we obtain (C). If (G@) is 
multiplied through by y:; and then summed with respect to j, and if finally 
the summation indices in the first sum are interchanged, we obtain (D). 
Next, we multiply (D) through by yi;, sum with respect to 7 and replace the 
indices 7, j7, and k in the second sum by k, j, and 7, respectively; in this way we 
obtain (A) .* 

The functions y;(2) which occur in our formule are solutions of Euler’s 
differential equations (2). Substituting these functions in the differential 
equations and carrying out the differentiation in the second member, we obtain 
the following identity: 


(18) fo, n+i + + Sui, n+k Yu 


We now multiply (/') through by y;;, sum with respect to j, interchange the 
indices j and k on the left side and add to it equations (18) with 2, substituted 
for x; in this way we obtain (B). Using y:; and x2 in analogous manner en- 
ables us to show that (#7) implies (£). It remains therefore to prove (G), 
(F),and (H). 

To prove the last two relations, we remember that wu; , and also vj, are con- 
jugate systems of solutions of the differential equations (4); hence we havef 


*(A) may ‘also be derived from (C). 
t Bolza, p. 626; Hadamard, loc. cit., p. 319 
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where u; and wu; designate the systems (uj1, Ujn) and (uy, +++, Win), 
and where 


dQ 02 
vinst) = >| 28]. 


k 
The equation y¥(u;; u;) = 0 reduces to 
+ vin) — Uit + Ric uje) = 0. 


If, in this relation we put x equal to x; and make use of the initial conditions 
to which the functions uj are subject, we obtain 


Qi + Rij Vix = Qi + Rei 


which is equivalent to ( F) in virtue of the meaning of the symbols Q;; and R;;. 
The corresponding treatment for the functions 2, will prove the relation 
(H). Finally we have for any two systems of functions uy, and »; the 
relation* 


( Wik + Rei Wik ) Uit ( Vik + Ru Vik) = const. 

If we put x successively equal to 2; and to 22 and set the two resu'ts equal 
to each other, making use at the same time of the initial conditions on the 
functions uj and 0, we obtain (@). 

3. THE CASE OF ONE VARIABLE ENDPOINT 


We suppose now that the endpoint P; of our extremal P; P2 is allowed to 
move along a curve L in the space of n + 1 dimensions, whose equations are 


(19) y=m(t), 
We suppose that this curve has no singular points, i. e., 
(20) (th+ Vin(t)P+0, 


and that it is of class C” in the interval (¢’ t’’). 

We assume furthermore that we have found an extremal E 
(21) = yi(2), m1 
which satisfies the following conditions: 

I. The condition of Clebsch:} 


~ * Bolza, p. 626; Hadamard, loc. cit., p. 319. 
t Bolza, p. 608. 
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At every point of the extremal the quadratic form >> Rx ¢;{; must be 
ik 


positive, i. e.: 
(22 Rix Se > 0, 2S 
t 


II. The condition of Mayer:* 


where 2; is defined as the root next following upon 2; of Mayer’s determinant 
(14). The point P; determined upon E by 2; is known as the conjugate point. 

III. The value ¢, of ¢ furnishes on L the point where it intersects the extremal 

E; i. e., 
4) =a, mi = ni(t) = = 

IV. The function f does not vanish at the point of intersection of the curves 
Land E;i. e., f? =f (a3 yi) 0. 

We ask now what further conditions the extremal EF must satisfy in order to 
furnish a minimum for the integral (1) among all admissible curves which 
join a point on ZL to P,. For this purpose we compute the extremal integral 
along E. Since the coérdinates of P2 are fixed and the codrdinates of P; are 
all functions of the single variable ¢, the extremal integral will become a func- 
tion of ¢ alone, 


I (a1; yo) = 1( E(t); 0(t); 223 ye) =T(t), 
which must satisfy the ordinary conditions for a minimum at ¢,, namely, 
I’(t:) = 0, 


By the use of formule (12), we find: 
ol ol , 
I’ (ti) = + = (Et ys — 


Its vanishing requires, as is well known, that Z must cut the extremal £ trans- 
versally at P,. This condition of transversality, together with condition 
IV implies that 


(23) +0, 
where ¢; = £1 — mi- 
A first value for I’ (t) is given by the formula 
T (t) + + +2 Ni 
(24) 


+> 


* Ibid., pp. 610-619. 


= 
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In the expressions (17) for the second derivatives of J we replace the solutions 
uu; of the Jacobi equations by the fundamental system of solutions ¢,, whose 
matrix reduces at x = 2, to the identity matrix. Then it follows that 


where Z(2x) is the determinant (r =n +1, ---, 2n;k =1,-+--, 


and Z% (x) is this determinant after its kth column has been replaced by 
fj1,°°*, Sin. With the help of (17,), (174), and (17s) equation (24) then takes 
the form 

I’ = A(a1) + 22), 


where 
A(m1) = —fOH —fPE + mi) 


B (21, 22) = >> (ao) Riz (21) 


1 
Z (x2) Fe 
We investigate now the behavior of the function B(x) = B(a,,2) as x 
varies from 2; to 2,, merely sketching in the briefest way, the proofs of the 


results. We find 


1 
(2) = Ra) Bi 


where R(x) is the determinant |Ry| and 
k 


@5) Rar (a1) Zit pre Zits Rij (21), 
in which Z;; is the cofactor of ¢,: in Z(a2), and p;, the cofactor of R;,(x) in 
R(x). The last expression for 8;; is derived from the first by applying suc- 
cessively formulz (70), (65), and (55) in Chapter XII of Bolza’s Vorleswngen, 
and remembering that the function y of formula (55) is a constant which can 
be evaluated at 2,, where the values of the functions {,, are prescribed. In 
these formule, the solutions z, u of Jacobi’s equations are to be replaced by 
solutions of the matrix ¢,, defined above; and for our case the multipliers p, r are 
all equal to zero identically, since there are no adjoined differential equations. 

The quadratic form )> Bj; ¢;¢; is positive, on account of condition (23) 
together with the fact that it is derived from the quadratic form }> pi. 212s 
by two linear transformations whose determinants are equal to Z and R 
respectively. We have Z + 0 because the Mayer condition is fulfilled and 
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because the Mayer determinant (14) reduces to Z when the functions (5) are 
expressed linearly in terms of the functions {,,. Further R + 0 on account of 
condition I, and since the matrix | ;,| is the matrix of the cofactors of R, the 
form >> pis21 2, is positive. If we now apply condition I once more, we 
conclude that B’ (x) is steadily negative on the interval 2; < x < 2}. 

The initial values of the functions ¢,, at x = 2; are known, and hence the 
first few terms of the Taylor expansions for the numerator and denominator 
of B(x) can be found. It is found that in the neighborhood of z;, we may 
write 


1 


so that B(2) approaches + © as x approaches 2, from the right. 
Finally, setting x equal to x; in the two expressions for 8;; given in formula 
(25), we obtain, since Z(2x;) = 0, 


R (2) Riz Z’ ZG) = Riz (1) Ze Pls Zits Ry (x1), 


the argument x being everywhere replaced by 2;. It follows from this that 
the numerator of B (2; ) reduces to 


1 
Bis 


which is different from zero; further, since the denominator vanishes, B(x) 
becomes infinite as x approaches 2x; , and it must approach —  , because B’ (zx) 
is negative. 

From these facts it follows that the function B(2,, x) decreases univari- 
antly from + © to — © as 2 increases from 2; to 2;. Hence there is an 
unique value 2; between 2; and x; such that I’ (t,) is positive for every value 
satisfying the condition x; < < The point defined on E by 21 
is called the “ focal point’ of Lon E. We have therefore the following the- 


orem: 

TuHeoreM II. Jf an extremal arc E with the properties I-IV minimizes the 
integral (1) with respect to arcs joining a fixed curve L to a fixed point P2, then 
it must be cut transversally by L at their point of intersection P,, and it must 
further be such that the focal point of L on E is not between P; and P3. 
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DEFORMABLE TRANSFORMATIONS OF RIBAUCOUR* 
BY 


LUTHER PFAHLER EISENHART 


When a system of spheres involves two parameters, their envelope consists 
in general of two sheets, say 2 and =,, and the centers of the spheres lie upon 
a surface S. A correspondence between = and 2; is established by the points 
of contact on the same sphere. In general the lines of curvature on = and 2, 
do not correspond. When they do, we say that is in the relation of a trans- 
formation of Ribaucour with 2, and vice versa. For the sake of brevity 
we call it a transformation R. 

It is a known property of envelopes of spheres that if S be deformed and the 
spheres be carried along in the deformation, the points of contact of the spheres 
with their envelope in the new position are the same as before the deforma- 
tion. Ordinarily when S for a transformation R is deformed, the sheets 2’ and 
>; of the new envelope are not in the relation of a transformation R. Bianchif 
has shown that when S is a surface applicable to a surface of revolution, it 
is possible to choose spheres so that for every deformation of S the two sheets 
of the envelope shall be in the relation of a transformation R. From the 
equations which Bianchi used to prove this result it can be shown that if 
S undergoes a single deformation so that the sheets of the new envelope are 
in the relation of a transformation R the conjugate system on the deform of S 
corresponding to the lines of curvature on the envelope is the same as before 
deformation. It is known that a conjugate system may be preserved in an 
infinity of deformations, or one, or none, the latter being the general case. 
It is the purpose of this paper to determine the transformations R whose 
surfaces of center admit one deformation into surfaces of center of trans- 
formations Rh. 

It is shown that the only surfaces = admitting such transformations R 
have the same spherical representation of their lines of curvature as isothermic 
surfaces, and that every surface of this type admits such transformations. 

The conjugate system on the surface S is a system 20, to use the notation 
of Guichard. A conjugate system remaining conjugate in a single deformation 
is called a permanent conjugate system. The present investigation carries 
= Presented to the Society, at Cambridge, Sept. 4, 1916. 

} Differenziale geometria, vol. 2, p. 117. 

Trans, Am. Math. Soc. 29 437 
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with it the determination of all permanent conjugate systems which are 20. 
It shows also that any conjugate system having the same spherical repre- 
sentation as a permanent system 20 is of this kind also. 

Evidently when one sheet of an envelope of spheres is a point or a plane 
the transformation is of the type R. Hence a special class of the solutions 
of our problem include all transformations R whose surfaces of center may 
be deformed so that in the new position the spheres pass through a point or 
are tangent to a plane, which we have shown to be respectively transforma- 
tions D,, of isothermic surfaces,* and transformations E,, of surfaces with 
isothermal representations of their lines of curvature. 

When our general results are applied to the transformations D,, of isothermic 
surfaces, we are led to the transformations 7, of these surfaces discovered 
by Bianchift and expressed intrinsically. The present results give a geo- 
metrical basis to these transformations and other observations made recently 
by Bianchi.§ 


1. TRANSFORMATIONS OF RIBAUCOUR 


Let 2 be a surface referred to its lines of curvature, « = const., » = const.; 
2, Y, 2, its cartesian coérdinates; X, Y , Z, the direction-cosines of its normal; 
Pi, p2, its principal radii of normal curvature. These quantities satisfy the 
following equations of Rodrigues:|| 


Ox Ox ox 
(1) au 9, = 9 = 


Darboux §] has shown that if \ and yw are any two functions satisfying the 


equations 
(2) au Pay, = 9 = 0, 


the parametric curves form a conjugate system, and that the spheres with 
centers on S and whose radii are given by \/y give a transformation R of Z, 
and the most general one. We shall now find the expressions for the cartesian 
coérdinates x;, yi, 2: of the transform 


*Rendiconti dei Lincei, ser. 5, vol. 24 (1915), p. 351. 
tAnnals of Mathematics, ser. 2, vol. 17 (1915), pp. 64-71. 
tAnnali di Matematica, ser. 3, vol. 12 (1906), pp. 19-54. 
§Rendiconti dei Lincei, ser. 5, vol. 24 (1915), pp. 377-387. 

|| E., p. 122. A reference of this sort is to the author’s Differential Geometry. 
q Lecons, vol. 2, p. 383. 


on the surface S whose codrdinates 29, yo, 2 are given by 
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If X’, Y’, Z’; X”, Y”, Z” denote the direction-cosines of the tangents to 
the curves » = const., « = const. respectively on 2, we have* 


»__1 1 oe 
VE Ou’ “= NG dv’ 
ax’ VE ax’ 
(4) 
aX" aX" __ Wy 
P1 dv p2 


The equation of the sphere of radius \/y and center (3) is reducible to 


where 21, yi, 21 are current codrdinates. The equations obtained by differ- 
entiating this equation with respect to u and » are 


(a — 2)? +2.VE > (2, — 2) =(Q, 


(a, — 2)? + AVG (a1 — x) = 0. 


These three equations may be replaced by 


1 
(5) — = (aX’ + BX" + pX), 
where m is a constant, and the functions a, 8, ¢ are subject to the conditions 
(6) a? + + = 2mdo, 
and 


Comparing equations (1) and (2), we see that d is a solution of the equation 
satisfied by z, y, z, that is, the point equation for 2. Expressing this con- 
dition, we find the expressions for da/dv and 08/du in the fundamental equa- 
tions (I) below. The third and fourth of equations (I) below follow from the 


+ Algebraic signs are given to these direction-cosines so that we have 
x” = 1 
Y 


On Or 
« VE, B VG. 
p. 157. 
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first two and (2). The functions & and / in the fifth and sixth of equations 
(I) may be taken as defined by these equations. The expressions for da/du 
and 08/dv are a consequence of the differentiation of (6). The fundamental 
system is 


dn 
>, = VE ’ 5, = ves, 

dloge loge _ 

da 1 avE VE 

(I) 

da 1 1 davE 

Oo VE du av ~ 

ap 1 avG VG 
ak 1avE 8B 

al 1 « 

=k | Ou 


The last two of these equatidns must be satisfied in order that the con- 
ditions of integrability of the derivatives of a and @ shall be satisfied; in 
deriving them we make use of the Gauss and Codazzi equations for 2 , namely 


dv\ VG dv du\ VE Ou pip2 =” 
(7) at 
(ME) 2 188 
Ov Pi Ov ’ Ou ou 


Equations (I) form a completely integrable system of differential equations 
and each set of solutions satisfying (6) gives a surface =,, which, as we shall 
show, is a Ribaucour transform of 2. 

From (5) we have by differentiation 


(8) —kX,, IX", 


) 
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Similar equations hold for the y’s and 2’s. Now X,, Y,, Xj, 
are the direction-cosines of the tangents to the curves » = const., wu = const. 
respectively on 2,. From these we find that the direction-cosines, X;, Y;, Z1, 


of the normal to 2; are of the form 


(10) + (1-4 


If E,; and G, are the first fundamental coefficients of 2;, we have on com- 
paring (8) with (4) that 
(11) VE, =-k, VG, =1. 


On differentiating (10) we get 


ox, 1 Oa at Ox, 1 02; 
Ou Ov pi Ov 


where pi; and py. are given by 


=0. 
po 
Since the preceding equations are analogous to (1), we know that the lines of 
curvature on 2; are parametric, and that pi; and py, given by (12), are the 
principal radii of normal curvature of 2;. Hence 

THEOREM 1. Every set of solutions of equations (I) which satisfy the quad- 
ratic relation (6) determines a transformation R of >. 

In consequence of (10) equation (5) can be given the form 


2. THE TRANSFORMATION R OBTAINED BY THE DEFORMATION OF S 


From (1) we find 


where 
B B 
12 
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where we have put 


1 
po 
Hence the first fundamental coefficients of S , whose coérdinates are given by 
(3), have the form 


2 
(16) =1i(1+(2)), = 1(1+(8)). 


When S is deformed into the surface of centers S’ of another transformation 
R, the functions of the latter must necessarily give equivalent expressions 
for Eo, Fo, and Go. Since the spheres are unaltered in size, the functions 
and p’ are given by 
(17) = pr, = ph, 
where p is a factor of proportionality.* 

We call 2’ and ¥; the two sheets of the new transformation R and indicate 
by primes all functions belonging to the latter. From (14) and analogous 
equations we have 


vE“(1+2*) (1+5%), 
Pi B 


(18) 
G2(1+2*) - @5(1+2*). 


In order that the expressions (16) may be equal to similar ones for S’, we 
must have in consequence of (18) 


Because of (17) these may be replaced by 
(19) a’ = pa, B’ = — pB.t 


When these values are substituted in equations analogous to the first two 
of (1), we find for the determination of o 


(20) 0 log p = 


log p _ 
Ou 


* By thus fixing the signs of \’ and y»’ we have merely chosen the positive direction of the 
normal to 2’. 

t The choice of these signs is merely equivalent to determining the signs of the square 
roots of E’ and G’ hereafter. 


2 
a’ a 
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Moreover, equations (18) are reducible to 


ve (1 +4%)- 2 x), 
pi 


(142 x) = 42%). 
p2 


If these equations be differentiated with respect to » and wu respectively 
and in the reduction use be made of the Codazzi equations (7) and similar 
ones for 2’, we get 


1 avE’ 1 OvE B 


(21) 


(22) 

1 G’ 1 a Vani 

Ve ou 7 + 


When these expressions are substituted in the Gauss equation for 2’, 
analogous to (7), we are led to the condition 


(23) 2VE’@’ — VE'(l— VG) + V@'(k — VE) + VGk =0. 


If we choose the constant m’ of the new transformation R equal to m, 
it follows from the foregoing results and (6) that 


(24) o’ = po. 


In order that yp’, a’, B’, and ao’, as given by (17), (19), and (24), shall 
satisfy equations for 2’ analogous to (1) we must have 


k’—k+ VE VE’ =0, 
(25) 
Vv +l— VG’ =0. 


Moreover, these equations satisfy equations similar to the last two of (I) 
when (23) is satisfied. 


3. EQUATIONS IN REDUCED FORM. THE INVERSE OF A TRANSFORMATION R 


Equation (23) may be replaced by the two equations 


VE =3 (NE -k) +S +h), 
(26) 


where @ is a function to be determined by the equation (22). Reviewing the 
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preceding discussion, we note that when such a function @ is known, the 
surface 2’ is defined intrinsically by (26) and (21). 
When the values (26) of VE’ and VG’ are substituted in (22), we get for 

the determination of @ 

1 

VG 

36 1 avG 
(VG +l)e e +k) (FOG +0 VE du ). 


These equations are satisfied by e? = 1. In this case 2’ is congruent to 2. 
Another constant solution is e® = — 1. Now equations (26) become 


(28) VE’ = —k, VG’ =. 


Comparing these equations with (11) and (21) with (12), we see that 2’ is 21, 
the transform of 2, and consequently its transform 2, is 2. By means of 
the foregoing results we shall then be able to find the expressions for the 
functions uw, o by which is obtained from In fact, 
from (20) we have, to within a negligible constant factor, p = 1/dc. 


Consequently from (17), (19), and (24) we have 


1 a 1 
It is readily shown that these expressions satisfy equations for 2; analogous 
to (I). 
4. SOLUTION OF EQUATIONS (27) 


In view of the foregoing results, we are concerned with the solutions of 
equations (27) other than e” = 1. By means of (I) these equations are 
reducible to 


where 

— o =~ 
(31) A= log (VE +k), B= log +1). 


The condition of integrability of (30) is reducible to 


(32) e*L—M=0O0, 

where 

(33) FB OB _ PA oi dB 


If L = M = 0, there are an infinity of deformations of the kind sought. 
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Bianchi has solved this problem and hence we exclude it from our considera- 
tion. 
Substituting the expression for e” from (32) in (30), we get 


du\ ~ M) du’ ~ DL) dv’ 


The integrals of these equations are 


L —2B M —2A 
where U and V are functions of u and » respectively. 
Since we have excluded the case e = 1, neither U nor V can be zero. 


The consistency of these equations necessitates 
UV — Ue* — Ve4 = 0,7 
which in consequence of (31) is equivalent to 


- (VG (VE =0. 


Because of the significance of k and | as shown by (11), we know that the 
parameters wu and » of the lines of curvature on = can be chosen so that U 
and V may be replaced by constants, which it is convenient to take as + 4. 
Hence we have for consideration the three cases 


(4) (VE 44%, 44, 
(35) U=-V=4, 
(36) (VE +k (WG +? = 4%, U=—-V=-—4. 


If we restrict our consideration to the case of real transformations of real 
surfaces, we may replace (34) by 


(37) (NE + = 24) 0s +1) = 24)Msin 


where w is a function to be determined and e = + 1 according as \/c is posi- 
tive or negative. Now (31) become 


A = log 2 Ve cos w, B = log 2 ve sin w, 
g 


and from (32) we get 


= — tan? w. 
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These values of A and B satisfy (32) when L and M are given the values (33). 
Since @ is necessarily imaginary, equation (34) does not give rise to a real 
solution of the problem. 

Before considering equation (36), we remark that if we put 


a=a, B=8, M=H, A=\, m=—m, 


equations (I) and (6) are satisfied, and from (5) it follows that the trans- 
form >; is the same as 5;. Hence equation (36) offers nothing different from 
(35), which we now proceed to investigate. 

If we replace (35) by 


(38) VE +b = cosh w, VG +1 = 24)sinh 


where € = + 1 according as \/c is positive or negative, we have 


A=log2vecoshw, B= log2vesinhw, 


and 
(39) e* = tanh? w. 
From (I) we obtain 
(40) (VE +) 5, (VE 
and _ 
a VE+k 1 a 
(41) 
VG+l10vE 1 B 
2 tog (VE +h) = 5 (VG +1)5. 


Substituting the expressions from (38), we get 


dw 1 avG a h 


do 6 
dv av oe 


(42) 


By means of (I) and (40) we find that the condition of nipeuenied of these 
equations reduces to 


8/1 avE 8/1 avG 
(43) av du 


We shall now determine under what conditions a surface satisfies (43). 
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5. SURFACES WITH THE SAME SPHERICAL REPRESENTATION OF THEIR LINES 
OF CURVATURE AS ISOTHERMIC SURFACES 

If the linear element of the spherical representation of 2 be written 


da? = + Gdr’, 
then 
(44) S=piE, G= 3G, 


as follows from (1). Hence the Codazzi equations (7) may be written 


lavs 1 1av§_ 


VG dv WG dv’ Ns Qu VE Ou’ 
and consequently equation (43) may be replaced by 
du\ VG dv J dv\ du 
It is our purpose to show that an orthogonal system on the unit sphere 
satisfying this condition represents the lines of curvature of two isothermic 


surfaces, which are the Christoffel transforms of one another by definition. 
In the case of an isothermic surface, equation (44) may be replaced by 


(45) 


a a 
(46) 
or 
b b 
47 
(47) p2 Ve 
where 
(48) E=G=d, or B. 
The Codazzi equations (7) may be given the form 
Ov — pi OW’ du — po Ou” 


Substituting in these equations from (46) and (47), we have respectively 


dloga 1 dloga 1 
(49) 
log b 1 log b 1 


Equation (45) expresses the necessary and sufficient condition that these 
equations shall be consistent. Conversely, when the spherical representation 
of the lines of curvature of a surface satisfies (45), one obtains by quadratures 


448 
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from (48) and (49) the first fundamental coefficients of two isothermic surfaces 
with the same spherical representation of their lines of curvature as the given 
surface. Hence we have 

THEOREM 2. The necessary and sufficient condition that a surface be iso- 
thermic or have the same spherical representation of its lines of curvature as an 
isothermic surface is that condition (43) be satisfied. 


6. TRANSFORMATIONS R OF THE TYPE SOUGHT 


Let = be a surface satisfying equation (43). Then equations (I) may be 
replaced by 


Or 


Ou VE Ou VG 
Ov po B, 
1 VE 
= NG B+u + 2m cosh w, 
da AVG «a dVE 
dv Ou’ Ou WG dav’ 
1 avG VG 
(50) =— NE ou a 2m sinh w, 
Ou 
ap -(2 — sinh  — VG x? 
do 10 
du VE Ou 
dw 1A€AVE 8 
dv WG dv V doe 


In consequence of these equations, the expressions for k and / of the surface 


namely 


Ne 

(51) k=2 pre cosh w — VE, 

satisfy the last two of equations (I). 


0 (G5) 
Lav 


Ou 


= sinh w— VG, 


Moreover, it is readily seen that 


), 


0 


av 


1 al 
k ou 


( 


1916] DEFORMABLE TRANSFORMATIONS OF RIBAUCOUR 449 


so that 2; is a surface satisfying (43), as was evident from general considera- 
tions. 
From (26) and (39) we have 


(52) VE = = G+ Mer, 


and from (25) 


Equations (22) reduce to 
6, 
VG’ ov VG ov 
1 avG’ 10, 


VE’ Ou VE Ou 


from which it readily follows that E’ and G’ satisfy (43), 
Equations (21) become 
WG 


Ps V oe 


(54) Pl hoe 


From (2) it follows that if a transformation FR is determined by functions 
d and yw, another R, is given by A + cand yp, where is any constant. De- 


noting the functions for R by X, pe, +*+,0, we have from (I) 
(55) A=A+e, a=a, B=8, m=m, 
and from (6) 
—_ 

Substituting these values in (I), we find that 

k= \(k(A+c) +eVvE), (late) 


If =, denotes the corresponding transform of 2, it follows from (10) that 
=, and 2; correspond with parallelism of tangent planes. 

For any transformation R the tangent planes to S are normal to the lines 
joining corresponding points on 2 and 2. From (5) and (55) it follows 
that S and S, the surface of centers of the transformation R, correspond with 
parallelism of tangent planes. 

When these results are applied to the type of transformations R given 
by (50) we have 
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7. WHEN = IS AN ISOTHERMIC SURFACE 


From (2) it follows that yu satisfies the tangential equation of 2, that is 
the equation satisfied by the direction-cosines of the normal to >. Hence 
if = is any surface having the same spherical representation of its lines of 
curvature as 2, the function uv and the function d, given by equations for = 
analogous to (2), determine a transformation R for =. We say that this 
transformation of = is obtained from the given transformation R of = by a 
transformation of Combescure. From this result it is clear that if we find all 
the transformations R of the type (50) of isothermic surfaces, all the other 
solutions of our problem are obtainable from these by transformations of 
Combescure. Accordingly we confine ourselves hereafter to the case where 
> is an isothermic surface. 

When & is isothermic, the parameters of the lines of curvature may be 
chosen so that 
(56) VE = VG 


Now 
(57) 


From equations (50) we derive the equations 


The integral of these equations is 


_ 


(58) 


where c denotes a constant of integration. 


and the last two of equations (50) reduce to 
Ou doe Ou Ou” 
_ _ log Vio 
dv Ov Ov 
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8. TRANSFORMATIONS D,, OF >. ISOTHERMIC SURFACES PARALLEL TO 2’ 


When ¢ in (58) is zero, equations (50) become 


_ 
ao” B, 
duo” dv 
Ov B, 
0B ag 
Ov Bau? du av’ 
= Feat + m(act — re), 
and from (51) we have 
(60) k= 


These are the equations of transformations D,, of isothermic surfaces into 
isothermic surfaces discovered by Darboux.* 
From (52) and (53) we obtain 


(61) VE =o — = 
and 
(62) k=l =0. 


Hence 2; is a point and not a surface. 

Conversely, if a transformation R can be deformed in such a way that one 
of the sheets of the new envelope is a point, the requirement that the lines of 
curvature on the two sheets of the new envelope shall correspond is satisfied 
identically and consequently the preceding formulas apply. For this case 
it follows from (53) that 

VE = VG = 


and consequently the transformation R is a D,,.¢ 


 *Annales de 1’Ecole Normale Supérieure, ser. 3, vol. 16 (1899), pp. 
491-508. 
Cf. Eisenhart, Rendiconti dei Lincei, 
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For the present case equations (20) become 


9 


—o os 
Ou Ov 0, 


from which it follows that po is constant. Since the functions of a trans- 
formation R are determined only to within a constant factor, it follows from 
§ 2 that the functions by means of which 2’ is transformed into the point 2; 
are given by 


From the general results of this paper it follows that >’ has the same spher- 
ical representation of its lines of curvature as two isothermic surfaces, say 2 
and >’. From (54) it follows that this spherical representation is determined 


by 

— VE’ pe-* VG’ e ou 
64 Vs’ = =— -, Panu 


Applying the results of § 5, we find that the first fundamental coefficients 
of = and >’ are given by 


— 


(65) VE = VG =oe*, VE’ = VG’ =o¢*/¢. 


The principal radii of normal curvature of these surfaces are given by 


(66) 


II 
ll 
| 
1% 


9. TRANSFORMATIONS D,, OF > AND LY’, AND THEIR DEFORMATIONS 


Since = and >’ have the same spherical representation of their lines of 
curvature as >’, a transformation R of each of these surfaces is given by 
taking for u the value of yw’ in (63). From the third and fourth of equations 
(I) we see that the corresponding functions a and £6 differ at most in signs 
from a’ and f’ as given by (63). We consider the two surfaces separately. 

For = we find that the functions are given by 


8 = 
a=", B=-, pak, A=Acte, 
m= 


* Evidently for the surfaces symmetric to = and =’ with respect to a point, the expressions 
analogous to (65) are the same and to (66) differ in sign. 


o 
1 1 
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where c is any constant. Furthermore, 


b= (oS + 


When c = 0, the surface S1 is isothermic and the transformation is a Bias 
The values of the functions for this case are 


a 
(67) a Oo’ = 


From equations analogous to (12) we find for the principal radii of 2, the 
expressions 


( 8) Pil P12 +x 
In a similar manner, transformations of 2’ are given by 
— co 


In particular a transformation D_,, of B’ is given by 


(69) = o’ Vv = o =k, 


and the fundamental functions for the transform Zz have the expressions 


(70) 1 N 1 


The isothermic surface defined by (70) has been considered by Bianchi;* 
he calls the process by which it is obtained from 2 and the functions of a 
transformation D,, of 2 the transformation T;,, determined by the given D,. 
This transformation was defined intrinsically by Bianchi without any indi- 
cation of its geometrical relation to 2. re 

We shall now apply the results of § 8 to these surfaces = and 2’, by de- 
forming the surfaces of center S and S’ of the transformations D_,, of these 
surfaces defined by (67) and (69). 


*Annali di matematica, ser. 3, vol. 12 (1906), pp. 19-54. 
Trans. Am. Math. Soc. 30 


k (¢ l + cé 
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We call (2’) the surface analogous to 2’. From (61) and (64) we find 
that its fundamental functions are given by 


(VE’) = (ce*+re*), ( NG’) = — — de), 


pi pi’ p2 
Hence ( 2’ ) has the same spherical representation of its lines of curvature as 2,. 


Furthermore, from equations analogous to (65) and (66), we find that the 
surfaces (=), analogous to 5, are defined by 


= — 1 1 1 1 
— = 1 1 
(jm) — = —., 
) = (We (=) pi’ (=) p2 


Hence (=) and (2’) are respectively the symmetric of = with respect to a 
point and its Christoffel transform. 

Proceeding in like manner with >’, we get for (2’)’, the surface analogous 
to >’, the functions 


(NEY =~ (act +20), (NPY == — 


(BY (BY 


This surface has the same spherical representation of its lines of curvature 
as X,. The surfaces (=)’ and (3’)’ are respectively the symmetric of 2, 
and the Christoffel transform. 

Let So be the surface of centers of the spheres enveloped by 2’ and 2), 
the latter being a point, say 0. Corresponding normals to 2’, 2, and D4 
are parallel. In like manner, the normals to 2; and >; are ere to the 
lines joining O to corresponding points of So. Hence the permanent conjugate 
system on So, that is the system corresponding to the lines of curvature on 2, 
project upon the unit sphere with center ( 0 into the orthogonal system repre- 
senting the lines of curvature on 2; and 2;. Applying the same reasoning to 
the transformation resulting from the deformation of S, the surface of centers 
of the spheres enveloped by 2’ and 2, we see that the lines joining 0 to points 
of So, the deform of S, are parallel to the normals to 2. Furthermore, it is 
readily seen that S “ So correspond with parallelism of tangent planes; 
and likewise S and So. 


 * The results of this paragraph were obtained by Bianchi by purely analytical processes 
as a result of his intrinsic definition of the transformation T,. Cf. Rendiconti della 
R. Accademia dei Lincei, ser. 3, vol. 24 (1915), ‘ 386. 
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10. CasE c + 0 IN EQUATION (58) 


When we substitute the value (58) of w with c + 0 in equations (50), they 
become 


Or Or 

ON 

=" 

au (Feet ), 

do cero 

da_ 406 0B _ 

Ov Bau? Ou Ov’ 

0B _ _ 9% 

av 
and 


For the surface >’ we have the functions 


73) 
VE’ urAQ+e VG’ pr+e., 
pi p2 po A 
and for 2; 
1 1/A+ 1 1/A+ 
c c 
= P1 +n), p2 +n). 


The last two are a consequence of (73) and of equations analogous to (12). 
From these expressions it is seen that 2; is an isothermic surface. We shall 
now determine its relation to Z. 

It is readily seen that, if \, wu, a, 8, o are solutions of equations (71), a 
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set of solutions of (59) is given by 
(A) =A+e, (a) =a, (8) =8, 

do 


Hence these functions define a transformation (D,,) of 2, and a comparison 
of (70) and (74) reveals the fact that 2, as given by (74) is homothetic to 
the transform of = by the T;,, determined by (D,). 


(m) =m. 


11. WHEN 2; IS A PLANE 


From equations (21), (53), and equations for 2’ analogous to (12), we find 
the following expressions for the principal radii of normal curvature of 2): 


(75) Pil Pi Vice 
Piz 2 V\oe 


Hence the necessary condition that 2; be a plane is 


VE WG 
p2 Vice 


(76) = 
where @ is a function thus defined. This condition is also sufficient; other- 
wise k’, l’, or both, is zero and the lines of curvature on 2, are minimal. 

In consequence of (76) equations (50) become 


an 
VEa, VGB, 
Ou_ 
ao B, 
0a 086 
+ net m( + we), 
(77) 
Ov Ow’ du av’ 
00 Ao _, 
dv * ut), 
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and the last two of equations (50) are satisfied identically. Now 
— 
(78) k= (« 4B), l (« WG). 
Making use of (11) and (12), we get 


(79) 

Hence = and  ; are surfaces with isothermal spherical representation of their 

lines of curvature. Equations (77) define the transformations E,, of surfaces 

with isothermal representation of their lines of curvature into surfaces of the 

same kind, which we have discussed previously from several points of view.* 
From (53) and (54) we get 


VE’ 
(80) é 
Vg’ = vo" 
P2 do” 


8’ and §’ being the coefficients of the spherical representation of 2’. Also 


so that the parametric curves on the plane 2; form an orthogonal system. 
Hence the permanent conjugate system on So, the locus of centers of the spheres 
tangent to >’ and X;, projects into an orthogonal system on the plane 2; .t 


12. IsoTHERMIC SURFACES DETERMINED BY A TRANSFORMATION Em 


From the general theory of §5 we know that there are two isothermic 
surfaces = and >’ with the same spherical representation of their lines of 
curvature as =’. We proceed to their determination. 

From (80) and (49) we find that the first fundamental coefficients of these 
surfaces are 


*Annals of Mathematics, ser. 2, vol. 17 (1915), p. 64. 
7 Cf. Bianchi, Rendiconti dei Lincei, ser. 5, vol. 24 (1915), p. 378. 


6 
W-“- 4, 
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and their principal radii have the values 


Pp, Ac No do’ No? 


From (20), (80), and (78) it follows that we may take p = u/dqo, so that the 
transformation R of 2’ into 2; is given by 


ho’ 


The transformations R of = which are the Combescure transforms of the 
former are defined by 


_ 
ho’ 


Ao’ 
(84) - _ 


where ¢ denotes a constant. From geometrical considerations we know that 
2, is a plane; it is referred to an orthogonal system whose coefficients are 
(85) b= (14%), i= +— (14%). 

When c = 0, the codrdinate system on the plane =; is isothermal. More- 
over, the functions (84) with c = 0 satisfy equations (59) and determine a 
transformation D_,, of 2. 

In like manner we have transformations R of 2’ given by 


a Xo’ B = 
2 
q Ao (c — p) 
an 
— 


The surface 5; is a plane, and for c = 0 we have a transformation D_,, of >’. 

The isothermic surfaces = and 5’ belong to the group discussed in § 8. 
For every transformation E,, has for one of its Combescure transforms a D,,, 
namely the transformation of the minimal surface having the given isothermal 


spherical representation of its lines of curvature. 
PRINCETON UNIVERSITY 
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JACOBI’S CONDITION FOR THE PROBLEM OF LAGRANGE IN THE 
CALCULUS OF VARIATIONS* 


BY 


DAVID M. SMITH 


INTRODUCTION 


The problem of Lagrange, in the calculus of variations, is that of minimizing 
an integral 


with respect to curves which join two fixed points, 1 and 2, and satisfy a 
system of differential equations of the form 


(B = <n). 
For this problem proofs of the necessity of the Euler-Lagrange Rule and 
the conditions of Weierstrass and Legendre are obtainable without the use 
of the second variation.t For the Jacobi condition, however, the situation 
is less satisfactory. Kneser’s proof{ of this condition, which is based on 
the theory of envelopes, excludes important special cases, while the proofs 
that utilize the second variation, though more inclusive,§ involve elaborate 
and complicated transformations. Similar remarks apply to the derivations 
of the Legendre condition by Clebsch|| and von Escherich,§/ both proofs 
being based on transformations of the second variation. Hahn’s deduction** 
of the Legendre condition from that of Weierstrass is more direct. 

The object of this paper is to secure inclusive proofs of the necessity of the 
Legendre and Jacobi conditions by means of the second variation, but with- 
out the use of complicated transformations. If the curves of the family 

* Presented to the Society, April 21, 1916. 

t Bolza, Vorlesungen tiber Variationsrechnung, pp. 558, 603. 

t Bolza, loc. cit., p. 610. 

§ In this connection see remarks by Bolza, loc. cit., p. 634. 

||Journal fiir Mathematik, vol. 55 (1858), p. 254. 

({Sitzungsberichte der Kaiserlichen Akademie der Wissen- 
schaften zu Wien, vol. 107 (1898), p. 1267. 

**Monatshefte fir Mathematik und Physik, vol. 17 (1906), p. 295. 


See also Bolza, loc. cit., p. 607. 
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(3) Yi = €) (i=1,2,+--,n), 


pass through the points 1 and 2, satisfy equations (2), and contain, for the 
parameter value e = 0, the minimizing curve Ey, 


yi = yi(z,0), 
then the function 


must have a minimum for e = 0. The arc Ej. must be an extremal, and 
there exists a set of functions A, (2) (8 = 1,2, ---, m), such that if 


F=ft+ 


then the derivatives J’(0) and J’ (0) are expressible in the forms* 


where the functions 7; (2) are the variations of the family (3), defined by the 
relations 


(5) 


If the are Ey. minimizes the integral (1) the first variation J’ (0) must vanish, 
and the condition 
(6) J”(0)20 


must also be satisfied, at least in the normal case, for every set of functions 
ni (x) which satisfy the equations 


(8) ni(%1) = = 0. 


The condition (6) implies a problem of Lagrange in the xy-space of pre- 
cisely the same type as the original problem in the zy-space. The integral 
to be minimized is J’’(0), and the equations of condition corresponding to 
equations (2) are the equations (7). “To this 2y-problem we apply the Euler- 
Lagrange Rule, the Weierstrass condition, and the corner-point condition,t 
all of which would necessarily be included in a complete discussion of the 


* Bolza, loc. cit., p. 558 and p. 620. 
Tt Bolza, loc. cit., p. 571. 
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Lagrange problem, and which, as was remarked above, may be derived with- 
out the use of the second variation. The application of the Weierstrass con- 
dition to the xy-problem just described furnishes a proof of the Legendre 
condition for the zy-problem which is even simpler than that of Hahn. The 
Euler-Lagrange Rule and the corner-point condition lead to a proof of the 
Jacobi condition which includes the exceptional cases of Kneser’s method, 
and which is much more simple and direct than the proofs which are based 
on the usual elaborate transformations of the second variation. 


1. PRELIMINARY DEFINITIONS AND THEOREMS 


In this paper the following notations will be used: 


ay: = F;, dy; G;, ay; dpi» Oy: al Vai» 
oF F 


Pix, = Ry. 


The various indices a, 7, 7, etc., will always have the ranges: 
a,B =1,2,++:,m; =1,2,-°-,n; 
h=1,2, +--+, 2n; 


and the integral (1) and the equations (2) and (7) will be written in the ab- 
breviated forms 


f da; oe (z,y,y') = 9; =0. 


Concerning the functions f, ¢, , and the arc Ey». the following hypotheses are 
made: 

(A) The are Ey is of class* C” throughout the interval (21, x2), and passes 
through two fixed points 1 and 2. 

(B) The functions f(x, y, y'), dg(%, y, y’) are continuous and possess 
continuous partial derivatives of the first three orders in a region T of points 
(x,y, y'), containing the values (x, y, y’) along Ey. in tts interior. 

(C) The are Ey. minimizes the integral (1) with respect to curves which lie 
entirely within the region T, pass through the points 1 and 2, and satisfy the 
differential equations (2). 

(D) The are Ey. is normal in every sub-interval (&, &) of (a1, #2), 
the only system of m functions Xg (x) of class C’ in (£1, 2), which satisfies the 
equations 


* A curve is said to be of class C‘ if its codrdinates, y; (x), are continuous and possess 
continuous derivatives of the first n orders. 
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is Ag 0 

(E) At every point of the arc Ey. the condition 
is satisfied. 


An admissible curve is defined as a curve of class D’ ,* which lies entirely 
within the region 7’, passes through the points 1 and 2, and satisfies equa- 
tions (2). The variations of a one parameter family (3) are the functions 
ni(x) defined by equations (5). A set of functions »;(2) which satisfy 
equations (7) and (8) is called a set of admissible variations, and it may be 
proved} as a result of the hypotheses just made that the elements 7; (2) of 
every solution of equations (7) and (8) are the variations of a one-parameter 
family (3). 

Let the family (3) be a set of admissible curves containing the minimizing 
arc Ey. for the parameter value e = 0. Substituting the functions y; (2, e) 
in the integral (1) we obtain the function J(e) of equation (4). In terms 
of the notations described above the conditions on the derivatives J’ (0) 
and J” (0) are expressible in the forms 


(9) (0) = Fs + = 0, 


J" (0) -f m Pix + 2ni + nim Ra) de 20, 


and these equations must be satisfied for every set of admissible variations 
ni(x). A consequence of the vanishing of the first variation (9) is the well- 
known Euler-Lagrange Rule: 

If Ey. is a minimizing are there exists a set of multipliers dg (x) such that at 
every point of E42 the equations 


(10) F; = 


are satisfied. 

An arc of class C’’, whose coérdinates y; with m functions d, (x), of class 
C’ , satisfy equations (2) and (10), is called an extremal. Under the hypotheses 
made the following theorems may be proved: 

TueoreM 1. [If 2’ and x” are the abscissas of two arbitrary points P’ and 
P” on the are Ey, then an n parameter family of curves 


* A curve of class D™ is continuous and is composed of a finite number of arcs, each of 
which is of class C™. 

T Bolza, loc. cit., p. 568. 

t Bolza, loc. cit., pp. 604, 565. 
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yi(z, bi, bn) 


can always be found, with the following properties: 
(a) The family contains the arc Ey. for the values (by, «++, bn) = (0,°++,0), 
2, S x S &%, as indicated by the conditions 


yi(z,0,+++,0) (2). 


(b) The functions y;, y:, , y;/Axdb;, are continuous within a certain 
neighborhood of the values (ic, b1, «++, bn) defining the are Ey. 
(c) The curves all pass through the point P’, i. e., 


yi by, +++, bn) = yi (2’). 


(d) The functions y;(x, bi, «++, bn) satisfy equations (2). 
(e) For the values = x'’, (bi, «++, bn) = (0, +++, 0), defining the point 
P” the condition 


is satisfied. 

THEOREM 2. To every normal extremal arc of class C”’ there belongs an unique 
set of multipliers (x). 

With the aid of Theorem 1 the Weierstrass condition* may now be derived. 
Through an arbitrary point 3 on the arc Ey: a curve C is drawn, 

yi = 

whose coérdinates are of class C’ and satisfy 
equations (2). The values y; (23) may be 
assigned arbitrarily, except that the set of 
values x3, yi (23), yi (a3) must lie entirely 
within the region 7 and satisfy condition 
(E). Let4bea point on the curve C whose 
abscissa is =2%3—e, (e >0). By 
Theorem 1 we can draw through a point 0 
on the arc Ey. a curve C of class C’ , 


1 


Yi = hi (z, e) 
which satisfies equations (2), passes through the point 4, 
Yi e) = y %4), 


and reduces to the are Ey, for the value e = 0. 
The one-parameter family V , 


V = Cos + Cas + Eso, 
* Bolza, loc. cit., p. 603. 


| dy: | 0 . 

labs | 
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is a set of admissible curves. Since all the arcs satisfy equations (2), f may be 
replaced by F, and the function J (e) for the family V expressed in the form 


= [Fic + [Fart [Par + Fae, 


where the multipliers \, (2) are those belonging to the arc Ey. Now J (e) 
must be an increasing function for the value e = 0, and consequently the 
condition J’(O0) = 0 must be satisfied. On evaluating J’(O0) with the aid 
of partial integration and equations (10) the following condition, due to 
Weierstrass, is obtained: 

If the are Ey. minimizes the integral (1) the condition 


E(x,y(x),y' (2), p,) F(z, y(x),p,r) F(z, 
— (pi yi) Gi(x, y(z), BO 


must be satisfied at every point P of Ey for all values p; = y; belonging to an 
admissible are through P . 


2. A NECESSARY CONDITION AT A CORNER POINT 


A continuous curve y; = y;(2) is said to have a corner point at x = 2; if 


the condition 
yi — 0) + yi (as + 0) 
is satisfied for at least one value of 7. To derive a necessary condition that a 


curve with a corner point shall minimize the integral (1) suppose given two 
normal extremal arcs of class C”’: 


Eis: yi = yi(x), Ex: yi = (2), 


which join the fixed points 1 and 3, and 3 and 2 respectively. Let the curve 
Ey. composed of the arcs E,; and E32. minimize the integral (1) with respect 
to admissible curves. 

By Theorem 1 an n-parameter family of curves can be found, 


(11) Yi = bi, bn), 


which are of class C’, pass through the point 1, contain the arc Ej, for the 
values (b;, ---, bn) = (0,---,0), satisfy equations (2), and for which 
the condition 


(12) 


is satisfied for the values x = 23, (bi, -~-, bn) = (0, +--+, 0) defining the 
point 3. 


OY: 
——|+ 0 
| 
|| 


1916] A PROBLEM OF LAGRANGE 465 


The parameters b;, may be determined as functions of new parameters e; so 
that the conditions 
(13) Yi (23, bi, bn) = + yi (3) 


are satisfied. For, corresponding to the values (é1, ---, én.) = (0, ---,0) 
there is a solution, b; = 0, of equations (13). Since the condition (12) is 
satisfied for the values x = 23, (bi, ---,b,) = (0, ---,0) it is possible to 
solve equations (13) for the b; and obtain nm functions 


(14) b; = en); 


which are of class C’ within a certain neighborhood |e,| < d and satisfy the 
conditions b;(0, ---,0) = 0. Substitution of the functions (14) in equa- 
tions (11) gives an n-parameter family of curves 

(15) Ys = Yi (2, en), 


which pass through the point 1, 


(16) = Yi 


contain the arc Ey. for the values én) = (0, +++,0), and satisfy 
the conditions 
(17) Yi (Xz, = + Yi (2s). 


In the same manner we construct a second family of curves 
(18) Yi = ln), 

which pass through the point 2, 

(19) Ui €1, = 


contain the arc Ey. for the values ---, = (0, -°--,0), and satisfy 
the conditions 
(20) Yi(%s, €1, = + yi (as). 


The ares (15) and (18) form an n parameter family of admissible curves. 
It is therefore permissible to replace f by F and consider the integral 


J Gm) = [ F(z, (2,0), (2,0), 


1 


+ [oF Ce, y(2,e), 9 (x,e),d)dz, 


in which the functions \,(«) and \,(a) are the multipliers belonging to 
E;3; and E32 respectively. Since J(0,---,0) must be a minimum the 
conditions 
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(21) (na Fe + nin + (a Fs + Win Gs) de = 0 


must be satisfied, where 


Apply now the usual integration by parts and use equations (10). Equations 
(21) then become 


(22) + G; |2 =0. 
But from equations (16) and (19) 

nik(%1) = iz (te) = 0, 
and from equations (17) and (20) 

nik = nik (ts) = dix, 
where 6; is zero for 7 + k and unity fort = &. On substituting these values 
in equations (22) we obtain 


The following necessary condition at a corner point may therefore be stated: 
If the minimizing curve Ey. has a corner point at x = x3 the conditions 


G; (a3 — 0) = G;(23 + 0) (i=1,2,-+-,n) 
must be satisfied. 
3. THE LEGENDRE CONDITION 
If we write 
(23) n’) (ni Nk Put 20: Nk Vix + Ni Nk Rix), 
then the condition 


(24) = [w(2, 9,1 )de =0 


must be satisfied for every set of admissible variations y;(x). The zn- 
problem implied in this condition is of precisely the same type as the original 
problem in zy-space. In the first place it is clear from equations (23) and 
(7) that w and ®, , as functions of x, 7, 7’ , satisfy the continuity conditions (B). 
Furthermore, since 


d d 
rate | bai — 7 (As ve) |, 
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it follows from hypothesis (D) that every extremal of the xn-problem is normal] 
throughout the interval (21, 2). We may therefore apply to the xn-problem 
the theory previously developed. 

By the Euler-Lagrange rule the codrdinates 7; of a curve which minimizes 
the integral J” (O) must satisfy, with m functions ys of class C’ , the equations 


d 
(25) 2, — 79% = 0, 
where 
=o Mg ,. 


Equations (25), when written out, form with equations (7) the system 
d 
>| m Pu t+ m Qk — aa + Ris) | 


d 
(26) opi — 7 vei) =0, 
+ vsini] = 0. 


Equations (261), which are linear and homogeneous in 7;, ug and their deriva- 
tives, are known in the theory of the second variation as the Jacobi equations. 
It will be understood hereafter that the elements u; (2), pg (x) of a solution 
(u; p) of equations (26) are of class C’’ and C’ respectively. 

If now the condition (24) is satisfied the arc in xn-space, 


C: n =0, 


must be a minimizing extremal for the integral J’ (0), since its codrdinates 
ni, With the multipliers ug = 0, satisfy equations (26) and give J’ (0) its 
minimum value zero. The uniqueness of the set of multipliers us = 0 is a 
consequence of hypothesis (D). Since the are C minimizes the integral J” (0) 
in the xn-space the Weierstrass condition for the xn-problem, 


(27) E(x,7,17', =0, 


must be satisfied at every point P of-the arc C for all values ¢: = 7; belonging 
to an admissible arc through P. The condition (27) for the integral (24) 
reduces to the condition that 


(28) Rin =O, 
at every point of C’, and equations (26.) become 
(29) verse = 0, 


where the arguments of Ry, and Wgz are the values x, y;(x), yi (ar), Ag (2) 
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defining the arc Ey. We have therefore the following analogue of the Le- 
gendre condition, due to Clebsch: 

If the are Ey. minimizes the integral (1) the condition (28) must be satisfied 
at every point of Ey2, for all values §; which satisfy equations (29). 


4. THe JACOBI CONDITION 


In deriving the Jacobi condition by the methods of this paper the following 
lemma is useful: 

Lemma 1. Let (u; p) be a solution of equations (26) such that the functions 
Ui, Ui, pg, all vanish for some value £,2, S&S 22. If the determinant 


Wans 0---0 


is different from zero along the are E2, then 


(30) R(z,y,y',r) = 


u(r) =0, pe(t)=0, 


Differentiate equations (26) with respect to z. If R +0 the resulting 
equations may then be solved with (26;) for 7; and wz. -A system is thus 
obtained of the form 
(31) = Ni, = gi(@,n,7', = hg(x,n,7', 4) 
with second members linear and homogeneous in the variables 7;, 7;, ues - 
Every solution of (26) is also a solution of (31). Now it is well known* from 
the theory of differential equations that there is one, and but one, solution 
of equations (31) which assumes for a given value of x a prescribed set of 
initial values. In particular, there is an unique solution which assumes the 
values (0,0,0) fora = &. Since this solution is evidently 


n = 0, = 0, 


the lemma follows at once. 
Assume now that a solution (u ; p) of equations (26) can be found in which 
the functions u; (a) vanish for two values £, 


(32) ui(fi)=O=u(&), 


without all being identically zero throughout the interval (£1, %). The 
curve C in the xn-space defined by the conditions 


n=0, = 0, &, 
ni = Mp=pe(t%), & 


n = 0, Me =0, & S252, 


* Bolza, loc. cit. , p. 171. 
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is an admissible curve for the xy-problem, and has corner points possibly at 
£, and &. If the curve C minimizes the integral J’ (0) the corner point 
condition 


(33) Qy (& — 0) = 2, + 0) 
must be satisfied. Furthermore 
(34) = 0. 
Since the values 7; (&) are all zero, while 
ni(f—0) m(&+0) =0, 
the conditions (33) and (34) may be written 


Rik + Ps Vaile, = 0, 


35 


If now the determinant (30) is different from zero at x = & then the only 
solution of equation (35) is 


(36) = 0, pp(&) =0. 


But if equations (32) and the cornerpoint conditions (36) are satisfied it 
follows from Lemma 1 that 


u;(z) = 0, pe (x) =0, 


which contradicts the hypothesis that the functions u;(2) do not all vanish 
identically. Therefore the corner point conditions (36) are not satisfied, and 
consequently the curve C cannot minimize the integral J” (0). 

Consider next the value of the integral J’’(0) taken over the curve C. 
Since equations (262) are satisfied at every point of C, w may be replaced 
by Q and the integral J” (0) for the xn-curve C expressed in the form 


(37) J” (0) Q(x, u,u’,p)dz. 


1 


Now by Euler’s theorem on homogeneous functions 
20 = (we Me, + + 
where the last sum vanishes, since 
2,, = Bp = 


along C. The integral (37) then becomes 
Trans. Am. Math. Soc. 31 


| 
i] 
| 
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J” (0) (ue + de. 
After integrating by parts, and using equations (25), it follows that 
J”(0) =4 = 0, 
on account of the conditions (32). 

Since the curve C does not minimize the integral J’’(0), and yet gives 
it the value zero, it follows that there must exist an admissible arc in the 
xn-space for which J’ (0) is negative and consequently the condition (24) 
not satisfied. In this event the arc Ey. does not minimize the integral (1). 
We may therefore state the following analogue of the Jacobi condition: 

Consider an arc Ey. that satisfies the conditions (A), . . ., (D) of section 1, and 
along which the determinant (30) is different from zero. Let &, & be any two 
points 

where the values 21, x2 define the end points of the arc Ey. If Ey. minimizes 
the integral (1) there can exist no solution (u;p) of equations (26) with ele- 
ments u; (2x) all vanishing at the end points of the interval ( £, , £2) but not identically 
zero within it. 

Two points £,, £ with which there is associated a solution (uw ; p) of equa- 
tions (26) in which the functions u;(2) vanish at & and £ without all being 
identically zero in (£1, &), are said to be conjugate. If, in particular x; = & 
the Jacobi condition may be stated: 

If the arc Ey, minimizes the integral (1) no point &— conjugate to x; can lie 
within the interval (21, x2). 


5. RELATIONS BETWEEN THE CONJUGATE POINT AND SOLUTIONS OF THE EULER 
EQUATIONS 


In the literature of the calculus of variations the conjugate points are 
usually defined as the zeros of a determinant, the elements of which are ob- 
tained by differentiating the general solution of equations (2) and (10) with 
respect to the constants of integration. This definition furnishes methods of 
great practical value for determining conjugate points, and it is therefore 
desirable to show that these criteria may be obtained from the definition of 
conjugate points given in the preceding section. For this purpose it is con- 
venient to prove the following lemmas. 

Lemma 2. If (2in ; Tan) is a system of 2n linearly independent solutions of 
equations (26) every other solution (u;p) of these equations is expressible in 
the form 


us = Ps = Ch 
A 
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In the first place, the solutions of the system (31) satisfy the Jacobi equa- 
tions (26;) and the conditions 


(38) (dei ni + Vai ni) = Cy. 


By choosing the initial values of a solution of equations (31) properly it is 
always possible to find a solution (z ; r) of the Jacobi equations (26;) in which 
the functions z;(2) satisfy equations (38) for a prescribed set of values Cz. 
Consequently we can adjoin to the 2n solutions (2; ; rg,) of the lemma m 
other solutions (2;, 2n+2; Ts, 2nta) Of the Jacobi equations, in which the func- 
tions 2;, on}, satisfy the conditions 


(39) ( dpi + Wei Zi, 2nta) = 


The 2n + m solutions of equations (31) thus obtained form a fundamental 
system. To prove this, write the product of the determinants |W,.| and D,; 
where 


Wea War dpi 0 

0 0 


In these determinants each element is a matrix whose dimensions are indi- 
cated by the subscripts attached, or else by its position. The ranges of the 
subscripts are those given at the beginning of section 1. The symbol 4;; 
stands as usual for unity or zero according as i = j or i + j. 

Using the rows of |y,.| and the columns of D;, and remembering that 
the elements of the first 2n columns of D, are solutions of equations (26), 
while the elements of the last m columns satisfy equations (39), we obtain 
the following product determinant of order 2n: 


Suppose now that D2 vanishes for some value x3, x; S 73 S22. Then it is 
always possible to find 2n constants ¢, such that 
(40) Ch2mtr, (3) = 0, Cn in (23) = 0, Ch Tan =0. 


Let 
0%; (2) = Lienzn(z), (x) = cn ten (2). 


The functions v; are solutions of equations (262). After substituting and 
putting x = 2; it follows from (40) that 


, 

| Zrh | 
Dz = | 

| Tah | 
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and consequently, since |Wga| + 0, that all the values v, (23) are zero. Since 
the functions »;, v; , 7g, all vanish at 2; it follows from Lemma 1 that 2; and og 
are identically zero. But this contradicts the hypothesis that the 2n solutions 
(zin ; Tan) are linearly independent. Therefore the determinants D2 and D, 
cannot vanish anywhere in the interval (21, x2) , and consequently the 2n + m 
solutions (2;, ; ra, ) of equations (31) form a fundamental system. 

Let (u ; p) be any solution of equations (26). Then it is also a solution of 
equations (31) and is expressible in the form 


(41) Ui = DC, Sing Us = pp = 
Vv 


Multiply equations (41,) and (412) by Wg; and ¢g; respectively and add the 
resulting 2n equations. Since the functions uw; and 2;, are solutions of equa- 
tions (26), while the functions z;, 24, satisfy equations (39), it follows that the 
constants ¢2,4, are all zero. Therefore 


ui = zin, = Tan, 


which proves the lemma. 

Lemma 3. If (in; ran) is a system of 2n solutions of equations (26), and 
if the determinant 
Zin 


A(z, Zin (21) 


does not vanish identically, then the zeros of A(x, x,) determine the conjugate 
points to 1. 

First, let 3 be a point conjugate to 1. Then there is associated with the 
point 3 a solution (u;p) of equations (26) in which the functions wu; (2) 
vanish at 2, and x; without all being identically zero throughout the interval 
(21,23). Since A(z, x,) = 0 the 2n solutions (2; ; rg,) are linearly inde- 
pendent, and consequently by Lemma 2 the functions u;(2) are expressible 
in the form 


u; (a) = Ch 
If now the functions u;(2) all vanish at x; and x3 the conditions 
(42) Zin (21) = 0, Zin (3) = 0, 


must be satisfied. This requires that A(23,2,) = 0. Therefore the abscissa 
x3 of every point 3 conjugate to 1 is a zero of A(z, 2). 

Next, let A(23,2:) = 0. Then it is always possible to determine 2n 
constants ¢c, such that equations (42) are satisfied. If we define 


a 
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= cn Pp = ch Tans 


we have a solution of equations (26) in which the functions u;(2) vanish at 
x, and x3 and yet are not all identically zero in (21, x3) on account of the 
hypothesis that A(z, 21) #0. Therefore every zero x3 of the determinant 
A(x, 2x:) is the abscissa of a point 3 which is conjugate to 1. 

Lemma 4. If (2in; Ten) is a system of n linearly independent solutions of 
equations (26) such that for all values of i and k 


(43) = 0, 


then every other solution (u;p) in which the functions u;(x) all vanish for 
x = 2, 1s expressible in the form 


ui = Ce zie, Ps = 


The proof is quite similar to that of Lemma 2. In the present instance 
we adjoin to the n solutions ; of the lemma solutions (2;, Tp, n+k) 
of equations (26), with the single hypothesis that 


(44) nee (21)|+ 0, 


and in addition the m solutions (2, +3 1s,2n+.2) Of equations (31) which 
were used in the proof of Lemma 2. The 2n + m solutions of equations (31) 
thus obtained form a fundamental system. To show this form the product 
determinant D2 and assume that it vanishes for some value x3, x1; S x3 S 22. 
By precisely the same argument as that used in Lemma 2 it follows that 


(45) zin (2) =0, (2) =0. 


On putting x = 2; in equations (45) and using (43) and (44) it will be seen 
that all the n constants c,,, are zero. Equations (45) then become 


ce zn (z) = 0, ce tax = 0, 


which contradicts the hypothesis that the n solutions (2:4; rg) are linearly 
independent. Therefore the determinants D2 and D, cannot vanish anywhere 
in (21,22), and consequently the 2n + m solutions (2;,;173,) of equations 
(31) form a fundamental system. It follows at once from Lemma 2 that 
every solution (wu ; p) of equations (26) is expressible in the form 


(46) u= Ch 2ihs = 


If the functions u;(2z) all vanish at 2, then 
Ch 2in(%1) = 0, 


and from the relations (43) and (44) it follows that the n constants ¢np,, are 
all zero. Equations (46) then become 
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us = Bik, Pp = 


Lemma 5. If (Zin; 1%) 18 a system of n solutions of equations (26) in which 
the functions 2, (2) all vanish at x,, and if the determinant 


D(x) = | | 


does not vanish identically, then the zeros of D (x) determine the conjugate points 


tol. 
First, let 3 be a point conjugate to 1, and (wu; ) the associated solution. 


By Lemma 4 
u= Ck Zik- 
k 


Since u; (23) = 0 the conditions 


must be satisfied, and therefore D(z;) = 0, that is, the abscissa x3 of every 
point 3 conjugate to 1 is a zero of D(z). Again, suppose that 


D(23) = 0. 
In this event n constants c; can be determined so that 


(2s) = 0. 


If we define 
u; = Pp = Tek, 


we have a solution of equations (26) in which the functions u; (x) vanish at 2; 
and x3; and yet are not all identically zero, on account of the hypothesis 
D(x) #0. Therefore every zero x3; of D(x) is the abscissa of a point 3 
which is conjugate to 1. 

As consequences of the foregoing lemmas we have the customary criteria 
for determining the conjugate points, as follows: 

TuHeoreM. Let 


(47) Yi = Yi(X, C1, Con), Ap = Ag (2, C1, Con) 


be a 2n parameter family of solutions of the Euler equations (10) and the equa- 
tions of condition (2), containing the minimizing arc Ey. and its multipliers for 
the values +++, Con) = (0,°++,0). If the determinant 


dyi (x) | 
Och, 


dy: (#1) 
Och 


A (a, 2%) 
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does not vanish identically along E12, i. e., when the constants c, are all zero, 
then the zeros of A(x, x,) determine the conjugate points to 1 on Ex. 

If the functions (47) are substituted in equations (2) and (10) identities in 
2, C1, ***, Com are Obtained.* Differentiating with respect to any parameter 
c, and then setting (¢1, ---, Cn) = (0,---,0) we obtain the well-known 
result that each of the 2n systems of functions 


dys 


Oc, (h =1,2,-+++,2n), 


in which the constants c;, are all zero, is a solution of equations (26) formed for 
the extremal arc E;2. Since the linear independence of the 2n solutions is a 
consequence of the hypothesis that A(z, 2,) + 0 the theorem follows at once 
from Lemma 3. 

THeorem. Let 


Yi = Yi (2, C1, Cn); Ap = Ag (2%, C1, Cn) 


be an n parameter family of solutions of equations (2) and (10), containing the 
minimizing arc Ey. for the values (c1, = (0, --+,0), and satisfying 
the conditions 

(48) Yi C1, Cn) = Yi (a1). 

If the determinant 


D(z) = 


does not vanish identically along Ey. then the zeros of D(x) determine the conju- 
gate points to 1 on Ex. 
The n systems of functions 
dy; 


k=1 2,°°° n), 


where = (0,--+,0), are solutions of equations (26). The 
linear independence of these solutions follows from the hypothesis that 
D(x) #0. Furthermore, it is clear from equations (48) that 


Oy: (21) _ 
Oc, 


The theorem is an immediate consequence of Lemma 5. 


*In this connection see Bolza, loc. cit., pp. 72, 623. 


ON TRANSCENDENTAL NUMBERS* 
BY 


AUBREY J. KEMPNER 


In 1851, Liouvillet gave the following theorem (the proof of which is very 
simple): 

Let f(z) = 0 be an equation of degree n = 2 with real integral coefficients 
and irreducible in the domain R(1), 2 a real root of this equation, and p/q 
any real rational number. Then a positive number A can be found, which is 
independent of n, such that for all p/q, 

Pls 
q| Ag” 


By applying his theorem to numbers of the form }>2_, (a,/l”), 1 a positive 
integer = 2, a, any integer whose absolute value is limited, Liouville showed 
that all such numbers are transcendental. 

By choosing / = 10, a rule is obtained for representing in decimal-fraction 
form the numbers contained in a certain non-enumerable set of transcendental 
numbers. f 

From Liouville’s method of proof it follows that all numbers }>2.y (a,/I’”), 
when only y, increases “sufficiently rapidly,” are transcendental. 

E. Maillet§$ proves in his book the transcendency of the members of certain 
sets of numbers, the simplest of which are represented by series of the type 


> (an pn”) 


for all rational and even for all algebraic x, and G. Faber|| uses a generaliza- 
tion of Liouville’s theorem and treats a more general type of series which may 


* Presented to the Society, December 30, 1915. 

tJournal de mathématiques pures et appliquées, vol. 16 (1851), 
p. 133. 

t The first proof of the existence of transcendental numbers was also given by Liouville, 
inthe Comptes-rendus, vol. 18 (1844), p. 883, p. 910 (reproduced in f), and was based 
on an investigation of continued fractions. 

§ Introduction a la théorie des nombres transcendants et des propriétés arithmétiques des fonc- 
tions, Paris (1906), p. 105. 

|| Mathematische Annalen, vol. 58 (1904), p. 545. 
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by this method be shown to yield transcendental numbers, namely all series 


of the form 
= hn 2” 


where the hn, Pn, Yn are integers, h, finite (or growing infinite with n in some 


particular fashion), 

=> 1, lim qn = ©; 
and he shows that f (x) is a transcendental number for all algebraic x. These 
seem to be about the only advances along this line.* Series of the type 
> 2.0 (an/a”) can not be treated by these methods. 

The object of the present paper is to prove the following 

THEorREM. Let a be an integer greater than 1, p/q a rational fraction, p=0, 
q>0; an, (n=0,1,2,---), any positive or negative integer smaller in 
absolute value than a fixed arbitrary number M , but only a finite number of the 
an, equal to 0; then ( (p/q)” is always a transcendental number. 

We shall prove the theorem in the form just stated. However, the proof 
still holds, with only slight modifications, for either or both of the following 
extensions: 

1. The exponent 2” may be replaced everywhere by b”, b any fixed positive 
integer greater than 2. 

2. a» need not be limited; for example, the number is transcendental when 
|an|< R*, R an arbitrary fixed number; or a, may be a rational number 
5/e, |5|< R", e < R", but always with the restriction that only a finite 
number of the a, equal to 0. 

Proof. All letters to be used will denote real numbers and all are integers 
except z and The symbol means , means r“? , ete. 

Max. (a1, d2, a3, «++, a@,) denotes the largest of the positive values a1, a2, 
3, 

Let z = )>25(an/a™)-(p/q)", with the restrictions on the an, a, p, 
and q mentioned in the theorem. 

If z is not a transcendental number, there must exist a certain equation 


f(z) = 4,2" = 


where k is a fixed positive integer and the A, are integers, 4, +0. Let N 
be a positive integer, such that N >|A,|, (u =0,1,---,%). We shall 


*E. Borel, Lecons sur la théorie de la croissance, recueillies et redigées par A. Denjoy, Paris 
(1910), Chapter V, and Axel Thue, Journal fiir Mathematik, vol. 135 (1909), 
p. 284, particularly the latter, have generalized Liouville’s theorem in important respects, 
but these generalizations have so far not had any influence on the investigation of trans- 
cendental numbers. 
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substitute z = >>%y(an/a”") - (p/q)" in f(z) and shall show that f(z) + 0 
for any given f(z). 

Once chosen, all of the following numbers are to be considered constant: 
an (n 0,1, a; p; 93k; Ay = 0,1,2, -++,k); M;N. 

Since >>? (an/a”") - x” is a power-series, convergent for all values of z, 
we may substitute z in f(z) and rearrange terms as we like. After substi- 
tuting, we shall arrange the terms according to increasing denominators, 
without canceling anywhere, and collect terms with equal denominators. 

The denominators are formed by taking the product of & or fewer factors 
of the form a” - q’, repetition admitted. 

We prove first the following statement: 

When at the same time n > k and n > where is a certain positive integer 
which will be characterized in the proof, then the three numbers 


satisfy the two conditions: 
(1) < Y2 < 


(2) Ys 


are three consecutive denominators of our fractions in )\*_, A, 2". 

It is clear that all denominators containing any factor a’ , v > n, are larger 
than 3, and of all denominators containing the factor a?", y3 is the smallest. 
Consequently all denominators smaller than 3 contain only factors a” - q’, 
(vy = 0,1,-+--,m—1). Of all denominators smaller than containing not 
more than k of these factors, (n > k), it is obvious that there is none be- 
tween 2 and y; and none between y2 and y3;. It remains however to be shown 
that y2 < 73, in spite of the higher powers of g involved in y2, that is: 


for all n from a certain value on. We have 


which is a fortiori satisfied when 


(qt) < a = 


this is true from a certain integer n = n; on. Thus our statement is proved, 
and we shall henceforth take n greater than Max (k,n). 
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To prove that f(z) + 0, we shall show that f(z) may be written in the 
form 


f(z) 


satisfying the following conditions: 
(a) ¢1, ce are integers, c: S 0, ce + 0; cz a real number & 0; y1, y2 numbers of 
the type defined above, so that 72 is a multiple of 71; yz =1+-71,1>1. 
| co | 1 
b) <—. 
v2 


1 
(c) |es| 


We must admit four possible cases: 
If c, = 0, cs = 0: f(z) + 0 because c. + 0. 


If cy = 0,¢3 + O:f(2) = > + ¢3; + 0 from (a) and (c), without using (0). 


If cy + 0,¢c3; = O: f(z) = 2 +2 + 0 from (a) and (6), without using (c). 


If c; + 0, cs + 0: therefore 
feo] 1 lex] _ 
2 Ti 72 + 
and hence 
C1 Co 
f(z) 


Always assuming that no cancellations have been performed, either in 


orin A, 2", 
our ¢2/72 shall consist of all terms which have exactly the denominator 


n being properly chosen. Evidently c2/y2 arises entirely from the term 
A; 2" of f(z), and we obtain 


| | kiN- M*. |p| 


and ¢c, + 0, provided n > nz, where nz is so large that ap,-; and all following 
coefficients are different from zero. 
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Our ¢:/7: shall comprise all terms with denominators smaller than 72. 
If ce; = 0, ¢;/71 is not needed. If c, + 0, then 


is common denominator of all fractions under consideration, which can there- 
fore be added together, giving ¢:/71. 
To prove |c2|/v2 < 1/71, we show that, from a certain n = nz; on, 


1 


We assume n > Max (k, m, m2). Our inequality reduces to 
k 
kl M* |p| tink) 

which is a fortiori satisfied if 


C,-|p|"? <a”, 
where 


_ ki 


q 


is independent of n; 


which is true from a certain n3 on. We assume n > Max (k, m1, m2, m3). 
This proves 

Our c; shall consist of all terms with denominators greater than 72, that is, 
of all terms of f(z) not accounted for by ¢;/y1 + ¢2/y2. Hence ¢; is built 
up of fractions of the form 


Ay, * Og, * * p 


where p = k, and 71,72, ---,7, are numbers of the sequence 0,1,2, «++ not 
necessarily different from each other. 

These denominators are all of the form a“ - g*, where the exponents of a 
are formed by taking the sum of k or less numbers of the infinite sequence 
2”, (v =0, 1, 2, -+-), repetition permitted, with the restriction that at 
least one of the k or fewer numbers 2” shall be greater than or equal to 2” , where 
n> Max (k, m, m2, 3). There are, as is easily seen, less than (n + 2)* 
exponents that can be so formed from the first n + 1 numbers 2°, 2', ---, 2”, 
counting exponents separately even when they differ only in the order of their 
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summands 2", 2%, ---, 2%. In the same way it is seen that there are less 
than (n + 3)* exponents formed by taking only numbers of the set 2°, 2',- --, 
2"*1; less than (n + 1 + 2)* by taking only numbers of the set 2°, 2', --- , 2"*# 
forl = 1,2,3, +++. The denominators a™ - ¢* all contain also factors q*. 
Hence there are certainly less than (nm + 2)* fractions with denominators 
smaller than a”, less than (n + 3)* fractions with denominators d, whese 
a” =d <a", because there are altogether less than (n + 3)* fractions 
with denominators less than or equal to a”, and less than (n +1 + 2)* 
fractions with denominators d, where a?" = d < a®". I increase (or at 
least do not decrease) the absolute value of all fractions by replacing g* by 1. 

Those numerators belonging to denominators d, a” S d < a”"", have all 
of their 71, 72, ---, 7, not larger than n, those belonging to denominators d, 
a” =d < a”, have all of their 7 = +1, and those fractions with de- 
nominators d, = d < have allr Sn +l. 

Altogether we find for cs: 


2)k. |p| kn |p| 
k(n+l) 
4 (ntl+2)t ten], 
a 


The convergence of this expression is obvious. Besides taking n greater than 
Max. (k, m1, M2, m3) we now take n so large that the sum in brackets is 


smaller than 
2 (nm + 2)* -|p|™ 


a>” 


This is certainly true when the ratio of two consecutive terms is always smaller 
than 3, and happens, for example, when n > log [k - loga|4p| ], as is easily 
verified. Let ns be an integer satisfying this relation, and take n greater than 
Max. (k, m1, m2, 3,4). Substituting, we have 


2N - M*- (n+2)*-|p|* 


< 
and we shall show that the expression on the right is, for sufficiently large 


values of n, less than 
1 


thus establishing the inequality |c3|< 1/72 and proving our theorem. 
Our inequality for n reduces to 
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which is a fortiori satisfied when 
Cz + (mn +2)* (gt)™ < (a?")™, 
where C, = 2N - M* is independent of n. 


By making n sufficiently large we can satisfy the following three inequalities 
simultaneously: 


<|pt|", |p|"<(gt)™, < (a")*, 


which, combined, prove our inequality for all n greater than a certain integer ng . 
By taking n greater than Max. (k, m2, m3, M4, 5, % ) we meet all restric- 
tions which have been successively imposed on n during the proof. 

The condition that only a finite number of coefficients shall be zero (in order 
to ensure c. + 0), I have not been able to remove. 

By taking p/q = 1, we see that all numbers >>%.» (a,/a”"), (am an integer, 
|an| limited, and from a certain point on |a,|2 1) are transcendental. 

As another special case we mention the function, }>%., 2”", introduced by 
Fredholm* to demonstrate the existence of analytical functions possessing 
certain peculiar properties on their natural boundaries. It follows from our 
theorem that this function has transcendental values for an infinite set of 
real rational values of the argument having the origin as a limiting point. 


*See Mittag-Leffler, Acta Mathematica, vol. 15 (1891), p. 279. 
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THE LINEAR DEPENDENCE OF FUNCTIONS OF SEVERAL VARIABLES, 
AND COMPLETELY INTEGRABLE SYSTEMS OF HOMOGENEOUS 
LINEAR PARTIAL DIFFERENTIAL EQUATIONS* 


BY 


GABRIEL M. GREEN 


INTRODUCTION 


In the case of n functions of a single variable, the vanishing of the wronskian 
is the most familiar criterion for their linear dependence. The wronskian 
also plays an important part in connection with the theory of a single ordinary 
homogeneous linear differential equation of the nth order, even in questions 
which do not concern the linear dependence of solutions of the equation. 
It is the purpose of the present paper to generalize the fundamental facts 
connected with these topics, to the case of functions of several variables. 

The characteristic property of an ordinary homogeneous linear differential 
equation is that any solution is expressible linearly, with constant coef- 
ficients, in terms of a fundamental set of solutions. The natural generalization 
to several independent variables is afforded by the completely integrable 
system of homogeneous linear partial differential equations, in one dependent 
variable, any solution of which is likewise linearly dependent on a finite number 
of solutions of the system. It is such systems which are discussed in the 
latter part of this paper; the subject of the linear dependence of functions of 
several variables is developed in the first part, chiefly with a view to its sub- 
sequent application to completely integrable systems. 

In our discussion of linear dependence, we consider throughout n functions 
Yi» Yn Of p independent variables up. Functions and 
variables may be either real or complex; if any of the variables be complex, 
we suppose the functions to be analytic in those variables. All of the main 
theorems, however, are stated and proved under the supposition that the 
independent variables are real; the modifications—usually simplifications and 
omissions—are easily supplied for cases in which one or more of the inde- 
pendent variables is complex. The n functions, then, of the real variables 

* Presented to the Society in two communications under different titles, October 31, 1914, 


and October 30, 1915. The results concerning the linear dependence of functions of several 
variables were read on the latter date. 
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Ui, U2, ***, Up, exist, together with such of their derivatives as come into 
the discussion, in a p-dimensional region, open or closed, which for simplicity 
may be taken to be a p-dimensional parallelopiped. This region we denote 
throughout by A. We shall suppose that all partial differentiations taken 
with respect to more than one of the independent variables are independent 
of the order in which the differentiations are effected. Thus, by the existence 
of 0? y/dudv we imply the existence of 0? y/dvdu and the identity of these 
two derivatives. 

In the case of functions of several variables, there is no one determinant 
which may properly be taken as the generalization of the wronskian. In 
fact, since there is no relation between the number of functions and the number 
of independent variables, no definite form can be prescribed for such a deter- 
minant; and moreover, the difficulty in this direction is increased by the 
fact that we do not assume our functions to be analytic. However, in the 
general criteria which we obtain concerning the linear dependence of n func- 
tions, a matrix takes the place of the wronskian. This matrix contains n 
columns, and each row is formed of the same derivative of the n functions. 
Evidently, the identical vanishing of all the n-rowed determinants of such a 
matrix is a necessary condition for the linear dependence of the n functions. 
The sufficient conditions at which we arrive are expressed in terms of the 
vanishing of the n-rowed determinants of a matrix of the kind just described, 
and are therefore also necessary—a point which we shall not mention in the 
body of the paper. The case of n functions of a single variable is of course 
included in our general theory; the matrix for such functions may have n 
or more rows, and if it has only n the determinant of the matrix is precisely 
the wronskian of the n functions. 

So far as we are aware, the only completely integrable systems which have 
been studied are of two kinds. Of these, the only non-analytic ones are of 
the first order, and are in fact simply systems of total differential equations.* 
For systems consisting of equations of order higher than the first, an analytic 
character has been presupposedf in establishing the existence of solutions. 
By a mere change of notation, however, we cause such a system to take on 
the form of a system of total differential equations, so that the existence of 
solutions may be proved in the non-analytic case for completely integrable 
systems consisting of equations of unrestricted orders. 

Although for a set of functions of several variables there is in general no 
distinctive determinant, formed from partial derivatives of the functions, 


* Cf., for instance, Mayer, Ueber unbeschrankt integrable Systeme von linearen totalen Differ- 
entialgleichungen und die simultane Integration linearer partieller Differentialgleichungen, 
Mathematische Annalen, vol. 5 (1872), pp. 448-470. 

t Cf. C. Riquier, Les syst2mes d’équations aux dérivées partielles, Paris, Gauthier-Villars, 
1910. 
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which might play the réle of a wronskian, we have been able to define a wron- 
skian for a fundamental set of solutions of a completely integrable system. 
By means of this wronskian, if we may so term it, we find certain properties 
of homogeneous linear completely integrable systems quite analogous to 
those of an‘ ordinary differential equation, linear and homogeneous, of the 
nth order. An important result in this connection is a proof that a large 
and important class of completely integrable systems of the kind considered 
may be put into a so-called canonical form by a.certain transformation of the 
dependent variable. The theorem is analogous to the familiar one in the 
case of an ordinary differential equation 


d” y y qd" y 
den + + = 9, 


according to which the equation can be reduced in essentially but one way to 
a unique form for which the coefficient p; is zero, by a transformation of the 
dependent variable alone of the form 7 = A(2)y. 

The reduction, just referred to, of a completely integrable system to a 
canonical form, is very useful in certain geometric questions. Completely 
integrable systems have been important in differential geometry, but not 
until recently has systematic use been made of them in connection with that 
branch of the subject for which they are most peculiarly fitted, viz., projective 
differential geometry. We owe to Wilczynski a very general method for 
dealing with questions in that field; in fact, according to this method, the 
projective differential geometry of a configuration is equivalent to the theory 
of the invariants and covariants, under certain continuous groups, of a com- 
pletely integrable system, the configuration being defined by a fundamental 
set of solutions of the system. The continuous groups which give rise to the 
invariants and covariants are generally broken up into two parts: a trans- 
formation of the independent variables, and a transformation of the de- 
pendent variable. The coefficients of the canonical form are invariant under 
this latter transformation, and in terms of these coefficients and their deriva- 
tives may be expressed all quantities having the same invariantive property. 
It is therefore highly desirable to have a method for computing these semin- 
variants, as they are called; we show that this can be done in a purely mechanical 
way for certain completely integrable systems. In the various particular prob- 
lems studied by Wilczynski and others, this method is the one which has 
always been followed. 

As an illustration, we show in § 7 how the foregoing may be applied to the 
theory of curvilinear coérdinates in space of n dimensions. 


Trans. Am. Math. Soc. 32 
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1. THE FUNDAMENTAL THEOREM 


Before considering the linear dependence of n functions, it will be con- 
venient to prove the following theorem for two functions: 

TueoreM I. Let y; and yz be functions of the p independent variables u; , uz, 

*, Up for which all partial derivatives of the first order, Oy:/Ouz,, Ay2/dux, 
(k = 1,2, +--+, p) exist throughout the region A. Suppose, further, that one 
of the functions, say yi, vanishes at no point of A. Then if all the two-rowed 
determinants in the matrix 


Y2 
dy: 
Ou, 
Oye 


Ou,’ 


dy: 
OU,’ OUy | 


vanish identically in A, y; and ye are linearly dependent in A, and in fact 


Y2 = cy. 
For, if 
’ 
we have on dividing by yj, 
(#) =0 (k =1,2,++,p), 
Our \ 
whence 
Y2 
=C, = Cy; 
Y1 


where ¢ is a constant. 

We might of course have made the statement of the hypothesis somewhat 
weaker, at the sacrifice of smoothness of statement, since all the two-rowed 
determinants of the matrix vanish if all those vanish in which the first row of 
the matrix constitutes a row. 

We note at once that in the case of functions of more than one variable 
there does not exist a single determinant which may properly be called the 
wronskian. We might of course speak of the two-rowed determinants of the 
above matrix as wronskians, but in the case of more than two dependent 
variables, which we shall now consider, new difficulties arise. First of all, 
there is no connection between the number of functions and the number of 
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independent variables. Moreover, the various partial derivatives of the 
functions may exist in a very unsymmetrie way, that is, a function may for 
instance be analytic in some of the variables, and may have only a few deriva- 
tives in the others. Consequently, there is not even a set of determinants, 
the structure of which we may describe once for all, which might take the 
place of the single wronskian in the case of functions of one variable. As a 
matter of fact, we shall see presently that very useful sufficient conditions 
for linear dependence may be stated, in which the partial derivatives with 
respect to some of the variables need exist only to the first order, while those 
with respect to other variables may be of higher orders. 

We proceed to the fundamental theorem, of which Theorem I is a special 
case. Let y1, y2, -**, Yn be functions of the p independent variables uw, we, 

-, Up. Let us denote by y/, y®, etc., partial derivatives of y;, of any 
order or kind whatever. As a matter of fact, it will not be necessary to 
specify just what derivative of y; is denoted by the symbol y’. If each one 
of the set of functions y; (¢ = 1, 2, -++, m) possesses a certain partial deriva- 
tive, we shall say that the set of functions possesses that derivative. Thus, if 
0? y;/Ou, Ouz, (1 = 1,2, +--+, exist, the set of functions possesses the 
derivative 0? y/du; due. 

We shall frequently have occasion to consider matrices of the form 


Y1; 
(1 1) 1 


| (2) (2) 2 


2, y? 


in which the first row consists of the functions yi, y2, «++, yr, and the other s 
rows of derivatives of these functions. We shall denote throughout the 
matrix just written by the symbol 


Me (Y15 


We may now state the fundamental theorem: 

THEOREM II. Let the set of n functions y1, y2, «++, Yn Of the p independent 
variables uy, U2, +++, Up possess enough partial derivatives, of any orders what- 
ever, to form a matrix 


M= ¥Y2; Yn)> 


of n columns and n — 1 rows, in which at least one of the (n — 1)-rowed deter- 
minants, say 


488 G. M. GREEN: [October 


‘ 
vanishes nowhere in A. Suppose, further, that all of the first derivatives of 


each of the elements of the above matrix M exist, and adjoin to the matrix M such 
of these derivatives as do not already appear in M, to form the new matrix 


which has n columns and at least n rows, so thatq2=n—1. Then 2f all the 
n-rowed determinants of the matrix M’ in which the determinant W, is a first 
minor vanish identically in A , the functions yi. Y2, +++, Yn are linearly dependent 
in A, and in fact 

Yn = C1 Y1 C2 Yo Cn-1 


the c’s being constants. 
Let us denote by W,, We, ---, Wy the (nm — 1)-rowed determinants of 
the matrix M, in such wise that W; is the determinant which does not con- 


tain the ith column of M. If we write y” = y;, we have by the hypothesis 


(1) Wi + + WYP = 


Differentiating each of the first n — 1 of these equations with respect to u,, 
we obtain the equations 


, Oy , , Oy? 


(j =0,1,---,n—2). 


But since all of the first derivatives of the elements of the matrix M occur in 
the matrix M’, the last part of each of the relations (2) vanishes because 
of (1), so that 

Ou, + du, + Ou, 
Multiplying the first of these by the first minor of y” (= y,) in W,, the 
second by the first minor of y\’ in W,, etc., the last by the first minor of 
y'—® in W,,, and adding the results, we find that 


aw aw, 
= 0. 
Uk 


Yo, Yn—1 | 
ay | 
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But this holds for every k; hence by Theorem I, since W,, + 0 anywhere in A, 
we have 

In like manner, 

W2 = Cs 


W,-1 = — Was 
the c’s being constants. Consequently, from the identity 


Win + We yo + wiislee + Wr Yn = 0, 
we have 


Wi(-an — Yn—1 + Yn) = (0. 


Since W,, is nowhere zero in A, it follows that 


Yn = C1 Y1 + C2 + Yn-1y 


whence our theorem.* 

It should be noted that if, as stated in the hypothesis, all the n-rowed 
determinants of M’ of which the determinant W,, is a first minor vanish, then 
all the n-rowed determinants of M’ vanish. This follows from the non-vanishing 
of W,,.t 

As an example of the application of Theorem II, consider the case of four 
functions of two independent variables, say 


yi = fi(u,) (¢=1,2,3,4). 
Then if in the matrix 


Oy2 
Ov’ Ov’ 


at least one of the three-rowed determinants vanishes nowhere in the region 
A, a necessary and sufficient condition for the linear dependence of y:, y2, 
¥3, ys is the identical vanishing in A of all the four-rowed determinants of 


*This proof is a direct generalization of the corresponding one given by Frobenius for 
functions of a single variable, in the case of the wronskian. Cf. Bécher, Certain cases in 
which the vanishing of the wronskian is a sufficient condition for linear dependence, these T rans - 
actions, vol. 2 (1901), pp. 139-149, § 1. 

+ Cf. Bécher, Introduction to Higher Algebra, p. 54, for the general theorem on matrices 
involved here. 


| 
Y35 Ys | 
Ou’ Ou’ Ou’? du 
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the matrix formed by adjoining to the above one all of the derivatives of 
the second order of y;, y2, ¥3, ys. If, however, at least one of the three-rowed 
determinants in the matrix 


Y2>5 ¥3, 
dy, 
Ou ’. Ou? Ou: 
Fy, Hye Hy 
Ou?’ Out’ Ou?’ du? || 


vanishes nowhere in A, then a necessary and sufficient condition for the 
linear dependence of y:, y2, ¥3, ys is the identical vanishing of all the four- 
rowed determinants of the matrix 


Py Py Fy Py | 
Bu’ Ov’ du’ dudv’ 


Ys Fy Hy Hy | 
Gu’ Ov’ du’ dudv’ 


so that the existence of even 0? y;/dv’ is not presupposed. 

In the fundamental theorem concerning the linear dependence of n func- 
tions of a single variable, the identical vanishing of the wronskian is shown 
to be sufficient for linear dependence provided the wronskian of n — 1 of the 
functions is not zero anywhere in the interval of the independent variable 
under consideration. This evidently comes as a special case under Theorem 
II. But Theorem II also affords other sufficient criteria in the case of a 
single independent variable. An example will show the general nature of 
these criteria. Let y,z, w be functions of the real variable z in an interval I. 
Denoting by accents differentiation with respect to x, let us suppose that 
the determinant 

ly, 3 | 


vanishes nowhere in 7. Then a sufficient condition for the linear dependence 
of y, z, w is the identical vanishing in J of the two determinants 


Z, w | Z, 


y 2 
| 
Iv Iv IV 
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However, in any case such as the present one, a very general theorem con- 
cerning linear dependence, due to D. R. Curtiss, is applicable.* 

Before leaving the fundamental theorem, it may be well to consider the 
case of analytic functions. For n analytic functions of a single variable, the 
identical vanishing of the wronskian is a sufficient condition for linear de- 
pendence, and nothing need be demanded as to the non-vanishing of the 
wronskian of » — 1 of the functions. This follows from the fact that for 
analytic functions linear dependence in any subregion implies linear dependence 
throughout their common region of definition. If in Theorem II the func- 
tions involved are analytic, an obvious weakening of the hypotheses may 
be made; for instance, it is sufficient to know that all the n-rowed determinants 
of the matrix M’ which contain the elements of W, vanish throughout an n-dimen- 
sional sub-region of A which contains a single point at which W,, is different 
% from zero. Such would be the most general theorem that may be stated in 
this case; however, by strengthening the hypotheses sufficiently we may 
obtain a theorem in which no demand is made concerning the non-vanishing 
of W,. A very elegant theorem of this kind is the following, which is due 
to O. D. Kellogg:t 

Let y1, Y2, ***» Yn be analytic functions of the p independent variables uy, ue, 

+, Up—real or complex—having a common region of definition A, and let 
Ma(y1; Y2,°**, Yn) be the matrix formed from the functions yo, Yn 
and all their partial derivatives up to and including those of ordern —1. Then 
the functions y1, Y2, *** 5 Yn are linearly dependent in A if and only if all of the 
n-rowed determinants of the matrix M, vanish throughout A. 


2. Two THEOREMS CONCERNING THE LINEAR DEPENDENCE OF ” FUNCTIONS 


In the present section we shall prove two theorems, which are generaliza- 
tions of theorems given by Béchert for functions of a single variable. They 
will be useful later, in connection with completely integrable systems of 
partial differential equations, just as Bécher’s theorems are in the case of 
ordinary linear differential equations, of which completely integrable systems 
are a generalization to several independent variables. 


THEOREM II]. Let yi, yo, «++, Yn be n functions of the variables u;, ue, 
+, Up. Suppose that the set of functions possesses the n — 1 partial deriva- 
tives y?, +++, P, (4 = 1, 2, +++, 80 that the function 


* Cf. Theorem VI, p. 292, of his article, The vanishing of the wronskian and the problem of 
linear dependence, Mathematische Annalen, vol. 65 (1908), pp. 282-298. 

t Nomograms with points in alignment, Zeitschrift fir Mathematik und 
Physik, vol. 63 (1914), pp. 160-161. The statement there given is equivalent to ours, 
if the proper interpretation is made of Prof. Kellogg’s definition of the rank of a matrix. 

t Cf. Theorems V and VI of the paper already cited, these Transactions, vol. 2 
(1901), pp. 139-149. 
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Y= C1 Yi + C2 Y2 +++ Yn; 


where the c’s are constants, has the same derivatives, yP ,y,--+,y" . Sup- 
pose also that no function of the form of y, other than zero, vanishes together with 
its derivatives y , y, «++, y» at any point of A. Then if the determinant 


— 

Ill 


vanishes at any point (a1, d2, +++, dp) of A, the functions y;, y2, Yn are 
linearly dependent. 

For, since W a2, = 0, there exist m constants C2, Cn 
not all zero, such that 


ay? (a, + (a, dp) + + tn dp) = 0 
(j =0, 


so that the function y = ¢; y1 + C2 ye + +++ +n Yn Vanishes together with 
its derivatives y, y?, ---, y™ at a point of A, and must therefore be 
identically zero. 

TueoreM IV. Let y:, y2, +++, Yn be a set of n functions of the p variables 
Uy, U2, ***, Up which throughout A possesses r partial derivatives (r > n — 1), 


Suppose that from among these derivatives may be selected at least n — 2, say 
(m= n — 2), such that all of their first derivatives occur 


among the set ¥?, and are bounded in A. Let 


M= Mn Un) (m2=n-2), 
and suppose that 
M'= Un) (q>m) 


is the matrix formed from M by adjoining to the elements of M those of their 
first derivatives which do not already appearin M. Assume also that of for any v 
columns of M (v = n) all the v-rowed determinants containing elements of the 
first row of M vanish identically, then all the v-rowed determinants formed from 
the same v columns of M’, and containing elements of the first row of M , and of 
v — 2 other rows of M, vanish identically.* Suppose, further, that no function, 
other than zero, of the form 


* Some remarks concerning this part of the hypothesis will be found after the proof of the 
theorem. 


(1) (1) (1) 

Yi; Yn 
(Ct ‘ 

| (n—1) n—1) (n—1) 
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Y = C1 Y1 + C2 Y2 +++ +OnYn, 


where the c’s are constants, vanishes together with its derivatives , y, 
at any point of A. Then the identical vanishing of all those n-rowed determinants 
of M’ in which appear elements of the first row and n — 2 other rows of M is a 
sufficient condition that the functions y1, y2, «++, Yn be linearly dependent. 

Suppose first that one of the (m — 1)-rowed determinants of the matrix M 
containing elements of the first row of M does not vanish identically. We 
may without loss of generality take for this determinant the one which appears 
in the upper left-hand corner of M. As in § 1, we denote this determinant 
by W,. Then at some point (a1, a2, ---, ap) of A, W, is different from 
zero. But W, is continuous in A, because all its partial derivatives of the 
first order exist and are bounded in A*; consequently, it differs from zero in 
some neighborhood of the point (a1, a2, ---,@,). By Theorem II, then, the 
functions y1, yz, ***, Yn are linearly dependent in this neighborhood, i. e., 
there exist constants ¢;, C2, --+, ¢n, not all zero, such that the function 


Y =O Yi + +onYn 


is zero in the said neighborhood of (a;, a2, ---, a>). But then this func- 
tion vanishes at (a1, d2, «++, @») together with its derivatives y™ , y®, ---, 
y™, and therefore, by hypothesis, vanishes identically. Consequently the 
functions y1, Y2, ***, Yn are linearly dependent in A. 

We now assume the other alternative, viz., that all of the (n — 1)-rowed 
determinants of M which contain elements of the first row of M vanish iden- 
tically; in particular, those formed from the first m — 1 columns of M do. 
We recall that, by hypothesis, if all the v-rowed determinants formed from 
the first vy columns of M and containing elements of the first row of M vanish 
identically, then the determinants formed from the first v columns of M’, 
. and containing elements of the first row and of vy — 2 other rows of M, vanish 
identically. We consider then in succession the first n — 1 columns of M, 
the first n — 2 columns, the first n — 3 columns, etc. Suppose that for the 
first n — ~ columns of M all of the (nm — 7)-rowed determinants which con- 
tain elements of the first row of M vanish identically. We know that this 
happens for i = 0 andi = 1. If it happens for i = n — 1, we shall have 
yi = 0, and then y1, y2, «++, Yn would be linearly dependent. Suppose then 
that i <m-—1. Then there must exist a particulari, 1 =n — 2, such 
that all the (m — 7)-rowed determinants formed from the first n — 7 columns 
of M and containing elements of the first row of M vanish identically, whereas 


*If for a function of p variables all the partial derivatives of the first order exist and are 
bounded in the p-dimensional region A, the function is continuous in A with respect to all 
the variables taken together. Cf. Osgood, Lehrbuch der Funktionentheorie, vol. I, second 
edition (1912), page 56. 
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the (n — i — 1)-rowed determinants formed from the first n — 7 — 1 columns 
of M and containing elements of the first row of M do not all vanish identically. 
But the identical vanishing of all the (nm — 7)-rowed determinants of the 
kind considered, formed from the first n — 7 columns of M, carries with it, 
by hypothesis, the identical vanishing of all the (m — i)-rowed determinants 
formed from the first n — i columns of M’ and containing elements of the 
first row and of n — i — 2 other rows of M. We may therefore apply the 
special case of the present theorem which we have already proved, to deduce 
the linear dependence of y:, y2, ***, Yn-i- Consequently, the functions 
Yi, Y2, ***, Yn are also linearly dependent, and our theorem is proved. 

It is to be noted that in the hypothesis of Theorem IV, a restriction has 
been placed upon the nature of the matrices M and M’ which we did not 
find it necessary to impose in the previous general theorems. As may readily 
be seen by an example, it is only in special cases that the identical vanishing 
of all the v-rowed determinants formed from v columns of the matrix M and 
containing elements of the first row of M implies the identical vanishing of 
all the v-rowed determinants formed from the corresponding v columns of M’ 
and containing elements of the first row and of vy — 2 additional rows of M. 
Probably the most important cases of this type arise when the functions 
Yi, Y2, ***,s Yn are solutions of the same completely integrable system of 
partial differential equations. We shall have occasion, in a subsequent sec- 
tion, to apply Theorem IV in just such a case. 


3. COMPLETELY INTEGRABLE SYSTEMS OF HOMOGENEOUS LINEAR PARTIAL 
DIFFERENTIAL EQUATIONS 


A system of partial differential equations in one dependent variable is said 
to be completely integrable if there exists one and only one integral of the 
system satisfying arbitrarily chosen initial conditions involving a finite number 
of arbitrary constants and arbitrary functions. The completely integrable 
systems which we shall consider are such that a general solution is expressible 
linearly, with constant coefficients, in terms of a fundamental system of solu- 
tions. Thus, these completely integrable systems, which we shall suppose 
to contain a single dependent variable and any number of independent vari- 
ables, form a natural generalization of the ordinary linear homogeneous differ- 
ential equation. 

So far as we are aware, the only completely integrable systems containing 
partial differential equations of order higher than the first which have hitherto 
been studied have analytic coefficients and solutions.* From the function- 
theoretic point of view these are of course the most important completely 


* Cf., for example, C. Riquier, Les systemes d’équations aux dérivées partielles, Paris, Gauthier- 
Villars, 1910. 
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integrable systems. They have found a place not only in analysis, but in 
geometry as well. The Gauss-Codazzi equations which form the basis for 
the metric differential geometry of surfaces constitute a completely integrable 
system, and in a later section we shall see how important are such systems in 
projective differential geometry. 

The simplicity of the analytic case, if we disregard questions of convergence, 
is apparent. Let us consider, as always, a completely integrable system of 
linear homogeneous partial differential equations containing a single de- 
pendent variable; then by successive differentiation of the given equations, 
all the derivatives of the dependent variable are expressible uniquely as 
linear combinations of a finite number nm among them, so that by well-known 
methods it is seen that the general solution contains exactly the same number 
n of arbitrary constants, which enter linearly. 

We shall confine our attention throughout to linear homogeneous com- 
pletely integrable systems in one dependent variable, though most of what 
we shall have to say may readily be extended to the case of several dependent 
variables. We shall not, however, suppose the coefficients in our equations 
to be analytic.* 

Let us first construct a completely integrable system which shall have as a 
fundamental system of solutions an assigned set of functions. Suppose that 
the n functions y1, y2, Yn of the p independent variables u2, Up 
possess enough partial derivatives, of any kind or order whatever, to form a 
determinant of the nth order, 


y2; 


which vanishes nowhere in a region A, and in which, as heretofore, we have 
denoted derivatives of a function y by y™, y™, ete., the indices being no 
indication of the kind or order of the derivative. Suppose also that all of 
the derivatives of the first order exist for all of the elements in the determinant 


* Completely integrable systems in which the coefficients are not restricted to being analytic 
were curiously enough the first: to be studied, but only such systems as consist of equations 
of the first order. These systems are really systems of total differential equations, and the 
classic work of Mayer in connection therewith is well known. See his memoir, Ueber un- 
beschrénkt integrable Systeme von linearen totalen Differentialgleichungen und die simultane 
Integration linearer partieller Differentialgleichungen, Mathematische Annalen, 
vol. 5 (1872), pp. 448-470. . 


| | 
| | 
| 
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W. Then we may set up as follows a system of partial differential equations, 
of which each of the functions y;, y2, «++, Yn is a solution, and such that any 
solution whatever, say 7, is of the form 


Y = C1 + C2Y2 Yn, 


where the c’s are constants. Let us write down the system of np equations 


Oy 
dus = as k) y k) aS k) eee + a®®) 1) 
dy 1, 1,k a, 
dy” 
(n—1, (n—1, k) 1,k 1, 1 


Ck =1,2, Pp). 


We may so determine the coefficients a{’” of this system that the functions 


Yi» Y2, ***» Yn Will be solutions of the system. If in any one of the equations 
these functions be substituted successively, n linear equations are obtained 
which may be solved for the coefficients a of the first mentioned equation 
since the determinant W is different from zero. If the coefficients a are deter- 
mined in this way, the system of differential equations will surely have as 
solutions each of the functions y;, yz, «++, yn. In fact, the system of np 
differential equations may be written in the compact form 


(5) | |=0 
| Ou,’ Ou,’ Ouy | 


(7 =1,2, -+-,p), 


which makes the statement evident. Moreover, it is also evident that any 
expression of the form 


Y= C1 yi 
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will also satisfy the system. Lastly, any solution of the system must be of 
this form. For, let 7 be any solution of the system, and consider the two 
matrices 


(6) 


M = Mii(m, y); 
M’ = M,(n, 


where M’ differs from M by the additional rows consisting of all of the partial 
derivatives of the first order of the elements of M which do not already appear 
in M. The derivatives appearing in these additional rows are precisely those 
which appear in the left-hand members of the system of differential equations 
(4), so that each of the additional rows is a linear combination of the n rows 
of the matrix M. Consequently, all of the (m + 1)-rowed determinants of 
M’ which contain the elements of M vanish identically. But one of the n- 
rowed determinants of M, namely W, vanishes nowhere in A, so that by 
Theorem II the n + 1 functions y;, y2, «++, Yn, 9 are linearly dependent, 
and in fact 
Y = C1Y1 +2 Y2 Yn- 


We may state our result as follows: 

TuHeoreM V. [f the system of partial differential equations (4) has n solutions 
Y1, Y2, ***, Yn for which the determinant (3) vanishes nowhere in A, then any 
solution of the system will be linearly dependent on these n solutions, and any 
function linearly dependent on these n solutions will also be a solution. 

We have therefore obtained a completely integrable system of which the 
given functions y1, y2, -**, Yn constitute a fundamental system of solutions. 
The form in which we have chosen to write the system of differential equa- 
tions is not generally the simplest for any given particular case; in fact, in 
many cases most of the equations will be found to be merely identities, and 
others of them obtainable from the rest by mere differentiation. The com- 
pletely integrable system will in such a case reduce to fewer than np equations. 
An example will make the point clearer. Suppose that the four functions 


Yr = Ye (U1, U2) (k =1, 2,3, 4) 
are such that the determinant 
Y1 Y25 ¥3 Y4 
dys dys dys 
Ou,’ Ou,’ Ou,’ Ou, 
W=| dy Ys dys 
Py Py 


} 
i 
| 


498 G. M. GREEN: [October 


is zero at no point of the region A. Then the completely integrable system 
of which the four functions y;, y2, ys, ys form a fundamental system of solu- 
tions will be 


Ou, Ou,’ Ou; OU. 
+50 


we have of course presupposed the existence of all the derivatives which we 
have written down. The first three of the above equations are identities, and 
as a matter of fact the third of them ought to occur twice. The completely 
integrable system therefore consists essentially of four instead of 2 x 4 = 8 
equations. Moreover, if dd’ — 1 is nowhere zero in A, the last two of the 
differential equations may be obtained from the two immediately preceding 
them by differentiation and algebraic processes. The completely integrable 
system of eight equations in this case reduces therefore to the two equations 


y ay 

our ay + =+ 
, Py 


It is evident that in any particular problem this form of the system is most 
useful; however, in the very general questions which we shall treat subse- 
quently, the form (4) in which we first wrote the system seems of great assist- 
ance, in spite of the fact that many of the equations may be superfluous. 
From what we have already seen, the analogy of homogeneous linear com- 
pletely integrable systems to ordinary homogeneous linear differential equa- 
tions in very close. In later sections the similarity will be made to appear 
even more striking; at present we take occasion to make the following remark. 
If in system (4) we keep & fixed, we have a system of n differential equations 
which may be regarded as a system of ordinary differential equations in which 
the independent variable is u,; and the dependent variables are m in number, 
viz., y, y, y, «++, yY. The completely integrable system (4) may there- 
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fore be regarded in p ways as a system of n ordinary differential equations in n 
dependent variables, containing p — 1 parameters. 


4. THE INTEGRABILITY CONDITIONS AND WRONSKIAN OF A COMPLETELY INTE- 
GRABLE SYSTEM 


In the last section, the completely integrable system (4) was set up on the 
supposition that a fundamental system of solutions was known. Naturally, 
if a system of this form, which we shall write 
ay 


Ou, Daf? (j =1,2, 


(7) 


is to be completely integrable and have precisely n fundamental solutions,- 
the coefficients cannot be arbitrary. We shall in the present section investi- 
gate certain relations which exist among the coefficients of system (7), under 
the supposition that the system has a fundamental system of n solutions 
Yi, Y2» ***» Yn in the region A. 

Let us call the quantities y, y , y®, ---, y“» which occur in the right- 
hand members of equations (7), the primary derivatives of the function y. 

The determinant W given by equation (3) is formed of the primary deriva- 
tives of a fundamental system of solutions of the differential equations. We 
shall call this determinant the wronskian of the system of solutions. In 
setting up the completely integrable system (4) we assumed that the wronskian 
of the fundamental system of solutions vanished nowhere in 4; this is equiva- 
lent to the assumption that if for every solution of the system of differential — 
equations a relation exists among the primary derivatives, of the form 


(8) ao y +ary™ + + =0, 


then all of the functions ao, a1, ++ , @n—1 must be identically zero. 

We shall from now on assume that in the region A the coefficients a” of 
system (7) are bounded and possess all partial derivatives of the first order. 
Then since by supposition there exists a function y not identically zero which 
satisfies equations (7), we may differentiate these equations and obtain in 
two ways an expression for 0? y“” /du;, du, (k +1). In fact, 


y + 


Ou, Ou ~ \ Our Ou; 
Ou, i=0 + Our F 


In the right-hand members of each of these, the derivatives of y may be 
replaced by their expressions in terms of y, y™, ---, y~» taken from sys- 
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tem (7). In this way two different expressions of the form (8) are obtained 
for a derivative 0? y /du, du, (k #1). But these two expressions are to 
be identically equal, so that by our assumption the coefficients of y in the two 
expressions must be identically equal. We may therefore state the theorem: 
TuHeoreM VI. A necessary condition that system (7) have a fundamental 
system of n solutions is that the coefficients satisfy identically the relations 


da” n—1 dah” n—1 
(9) + ah? — + 
i=0 Uk i=0 


(v,j =0,1, 


We shall speak of equations (9) as the conditions of complete integrability, 
or as the integrability conditions, of system (7). 

From the set of integrability conditions (9) let us pick out n equations, 
by keeping k and / fixed and choosing vy = 7 = 0,1, ---,n—1. The equa- 
tions are 


da? k) n—1 da b n—1 
l i=0 k i=0 


Adding them, we find without difficulty that all terms drop out except the 
first derivatives, and in fact 


= (k,l=1,2,---,p), 


fj Ou 
that is, 


Consequently, the n quantities }>7=) a” are the first derivatives of some 


function f (uw, U2, «++, Up), and we may write 
(11) ——= diay” (k =1,2,---,p). 


We shall presently find relations (10) and (11) very useful. 
Let us now form the wronskian (3) of any fundamental system of n solu- 
tions ¥1, Y2, ***, Yn of system (7): 


Y1> Y2; 

1) (1) ql) 

(n—1) n—1) (n—1) 

y; » Un 


We differentiate W with respect to u,; if the result be written as the sum of 
the n determinants formed from W by replacing a row thereof by the deriva- 


| 
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tives of that row with respect to u,, and if use be then made of equations (7) 
to replace any derivative by an expression linear in y;, y{”, 
y!—, it will be found without difficulty that 


ow (7,4) Of 


Consequently, 
W = const. 


We may therefore state the theorem, which is a direct generalization of the 
familiar theorem of Abel in the theory of an ordinary homogeneous linear 
differential equation of the nth order: 

TuHeoreM VII. The wronskian W of a fundamental system of n solutions 
of the completely integrable system (7) may be determined by a quadrature from 
the coefficients of the system, and is given by the expression 


(12) W = const. e’, 


where f may be found by a quadrature in virtue of equations (10) or (11). 

As a corollary we have the theorem: 

TueEoreM VIII. If the wronskian vanish at any point of the region A, it 
vanishes identically in A .* 

We remark in passing that equations (10) and (11) might have been deduced 
directly from a consideration of the wronskian. For, equations (10) may be 
deduced by differentiation of the relations 


ow xt 
(5, k=1,2,-++,p). 
(13) W ( 
But the relations (13) are of interest in themselves, because they show that 
the function >>7=} a” is the derivative, with respect to u;, of a function f, 
for k = 1,2,---,p, and this without necessitating the assumption of the 
existence of any of the first derivatives of the coefficients a” of system (7). 


5. THE NORMAL FORM OF THE COMPLETELY INTEGRABLE SYSTEM 


A necessary condition that a system of partial differential equations of 
the form (7) have a solution not identically zero is that the coefficients of the 
system satisfy the integrability conditions (9). Whether the condition is 
also sufficient remains to be seen; as a matter of fact, however, an example 
will show that a system of form (7), whose coefficients satisfy the integrability 
conditions, need not have a solution which is not identically zero. For, if 
we select as primary derivatives y and 0? y/du”, and suppose the independent 
variables to be wu and v, we may write down the system of form (7): 


 * We denote by A the region in which all of the solutions yi, y2, --*, Yn exist. 
Trans. Am. Math. Soc. 33 
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Oy _ ry ys, Py 
ey ey ey 


which has no solution different from zero even if the integrability conditions 
of form (9) are satisfied for the system, unless of course other relations exist 
among the coefficients. In fact, differentiation of the first equation of the 
system twice with respect to w will lead to two new equations which are in 
general incompatible with the first and third equations of the system. In 
our example this results from the peculiar choice of primary derivatives, a 
gap occurring between y and 0? y/du’. 

We shall presently show that it is always possible, for a system of form (7) 
which is known to have n fundamental solutions, to make a particular choice 
of primary derivatives which removes the above difficulty, although at a 
slight sacrifice in generality. Let us first define what we shall call a chain of 
derivatives and a normal set of derivatives. A chain of derivatives of a func- 
tion y is a set of derivatives y, y , y, ---, y, such that y is of the ith 
order and may be obtained by a differentiation from y“. Thus, denoting: 
partial differentiation by subscripts, we see that the quantities y, yu, Yu, 
Yuu» form a chain, while y, yu, Yur» Yuuu do not because yuu. cannot be formed 
from yu» by differentiation. 

A normal set of derivatives is a set such that each element of the set belongs 
to at least one chain which can be formed from elements of the set. The 
normal set need not consist of a single chain, and a given member of the set 
may belong to more than one chain. 

We shall now show that if system (7) has exactly n fundamental solutions 
Yi,» Yo, ***, Yn, the primary derivatives may under certain conditions be 
so chosen as to form a normal set. However, in order not to interrupt our 
reasoning later, we shall first prove a lemma which is to be used in the course 
of the discussion. 

Lemma. Suppose that system (7) has n solutions y;, ye, «++, Yn whose wron- 
skian W vanishes nowhere in the region A. Then y:, y2, +++, Yn cannot all be 
solutions of a system R of form (7) having less than n primary derivatives, these 
primary derivatives being selected from among those of system (7). 

Let y, y, y, «++, y? (r <n —1) be the primary derivatives of the 
system R. Then the lemma follows at once if in the matrix 


M —= Yn) 


at least one of the (r + 1)-rowed determinants vanishes at no point of A, 
for then this determinant plays the réle of a wronskian for system R. Sup- 
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pose, however, that each of the (r + 1)-rowed determinants vanishes some- 
where in A. They can not all be identically zero in A, because then the 
wronskian of y1, y2, ***, Yn would be zero also, contrary to hypothesis. 
Consequently one of these determinants, say that formed from the first r + 1 
columns of M, is different from zero at some point of A, viz., (a1, «++, Gp), 
and therefore, since it is continuous, it differs from zero in a p-dimensional 
region A’ containing this point. Then in this region A’, every solution of 
system R is linearly dependent on y:, y2, «++, Yr+1, 80 that in A’ every solu- 
tion of (7), being also a solution of R, will be linearly dependent on y:, y2, 
*++, Yr41. Then in particular we shall have in A’ 


= C1 Y1 + Co Yo + 


the c’s being constants not all zero. Therefore the function 


will vanish identically in A’, and so will its derivatives y™, y, ---, y™. 
But it is easily seen that if system (7) has solutions y;, y2, --*, Yn for which 
the wronskian vanishes nowhere in A, then any solution all of whose primary 
derivatives vanish at some point of A will vanish identically in A. Conse- 
quently the function y considered above vanishes identically not only in A’, 
but throughout A. Therefore we conclude that system (7) would have only 
r + 1 linearly independent solutions, which contradicts the hypothesis. 

We now proceed to show how system (7) may be reduced to a similar 
system for which the primary derivatives form a normal set, under the hypo- 
thesis that system (7) has exactly n fundamental solutions y;, yo, «++, Yn 
whose wronskian W does not vanish at any point of a region A of p dimensions. 
We assume, then, that the n primary derivatives of system (7) do not form a 
normal set. Then we may select from among these primary derivatives a 
normal set which consists of at least one element, namely y; let this normal 
set be y, yY, y, ---, y, (r<m—1). Suppose, furthermore, that it is 
the largest normal set which may be formed from among the primary deriva- 
tives. Then of the first derivatives of y, y, y, ---, y™ none is a primary 
derivative of system (7), for otherwise our normal set could be enlarged.* 
Now, all of the first derivatives just mentioned are expressible linearly in 
terms of the primary derivatives of system (7), the coefficients in the expres- 
sions being continuous in A. This gives us p(r + 1) equations, in the right- 
hand members of which appear some or all of the quantities y, y , y, ---, 


* Lest a misunderstanding occur here, we point out that if y and y™ are primary deriva- 
tives, and if dy/du; = y™, then this equation is one of the equations of system (7); also, 
if y be one of the normal set y, y™, ---, y™ under consideration, then y™ belongs to the 
said normal set also. 
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y®. But in the right-hand member of at least one of these equations must 
appear some one of the primary derivatives, say y“, which is not one of 
our normal set, for otherwise the p(r +1) equations would constitute a 
system of form (7) whose primary derivatives are y, y, y, ---,y™,i.e., 
the said p(r-+ 1) equations would constitute a system R such as is con- 
sidered in the lemma proved above, and the original system (7) could then 
have no more than r + 1 linearly independent solutions, contrary to hypothesis. 
Consequently, of the first derivatives of y, y®, y, ---, y™, at least one 
has a linear expression, in terms of the primary derivatives, which contains a 
primary derivative y“ which is not a member of the normal set y, y® , y®, 
, y. In other words, the coefficient of y“ in this expression is not 
identically zero; since this coefficient is continuous, there must exist a p- 
dimensional sub-region of A, say A1, at no point of which it vanishes. Con- 
sequently, in the region A; we may express the primary derivative y linearly 
in terms of all the other primary derivatives, and the additional quantity 
which occurs in the left-hand member of the equation from which we have 
just found the expression for y“. This additional quantity, we recall, is 
one of the first derivatives of one of the elements of the normal set y, y™, 
y™,-++,y, and is not itself a primary derivative. Let us denote it by y. 
Then throughout (7) we may replace y“ everywhere by its equivalent ex- 
pression, and thus obtain a new system of differential equations, of form (7), 
in which the primary derivatives are the same except that y has taken the 
place of y‘. This new set therefore contains in it a normal set composed of 
the previous normal set y, y, y®, ---, y® and the additional derivative 
y. The new wronskian of the system of differential equations vanishes 
nowhere in the p-dimensional region A,. But although the coefficients in 
system (7) are supposed to have all of their first derivatives, it is logically 
possible that all of the first derivatives do not exist for the coefficients of the 
new system. We shall suppose that these derivatives do exist.* Then on 
the new system of differential equations we may proceed as before, enlarging 
step by step the number of primary derivatives which form a normal set until 
finally a system is obtained for which all of the n primary derivatives form a 
normal set. In this reduction we have supposed of course that certain deriva- 
tives exist for the coefficients in system (7). In any event, it is clear that 
such derivatives need not exist of order higher than mn — r — 1, where r is 
the number which appears in the above discussion. 
We shall henceforth suppose that the primary derivatives in system (7) 


* The writer does not know whether they exist as a consequence of the previous hypotheses 
on system (7) or not. Since the form to which we shal! ultimately reduce our system seems 
essential for the proof of the existence theorem, it would be of interest to have this point 
settled. 
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form a normal set, and shall then speak of system (7) as being in the normal 
form. The chief importance of the normal form lies in the fact that system 
(7) in that form may be regarded as a system of np partial differential equa- 
tions of the first order in the p independent variables u;, wz, «++, up, and the 
n dependent variables y, y, y®, ---, y“, since among equations (7) 
occur a certain number which define each primary derivative as the first 
derivative of another primary derivative belonging to the same chain.* 


6. THE CANONICAL FORM OF A COMPLETELY INTEGRABLE SYSTEM 


It is well known that by a transformation of the dependent variable of the 
form y = \(x)¥% any ordinary homogeneous linear differential equation of 


the nth order, 
d"y d™1y 


whose coefficients are continuous functions of x, may be transformed into one 
of the same form in which the coefficient p; is zero. We shall now show that 
a similar thing may be done for certain completely integrable systems. 

We introduce a new dependent variable, 7, connected with the old by the 
equation 

where J is a function of u;, W2,-+++, Up. Any derivative of y, say y , is con- 
nected with derivatives of the new dependent variable, 7, by an equation of 
the form 
y = hy fees, 

where the part omitted in the right-hand member is linear in derivatives of 7, 
and contains all those derivatives of 7 from which 7” may be obtained by 
differentiation. 
_ Let us now carry out this transformation on a particular kind of completely 
integrable system, namely, one in which the primary derivatives not only 
form a normal set, but have the additional property that, if a derivative y 
belong to the set, then every derivative of lower order from which y is ob- 
tainable by successive differentiation is likewise a primary derivative. It is 
not difficult to see that in this case, if y be replaced everywhere by Ay, the 
system of differential equations is changed into a new system, with the de- 
pendent variable 7, which new system may be thrown into exactly the same 
form as the original system. Let us write the new system 
oy 


=(j, k) =(i) 
aj 
i=0 


(14) 


* See footnote on page 503. 


(j =0,1,---,n—1;k =1,2, 


506 G. M. GREEN: [October 


where the coefficients a” are functions of the old coefficients a?” (of 
system (7)) and also of \ and its successive derivatives. Of course the actual 
expressions depend upon the form of each individual primary derivative, 
which form we have not specified. 

We may, however, obtain explicit results with regard to the transformation 
of the wronskian and its first derivatives. Let us denote by W the wronskian 
for the transformed system (14). We recall that the set of primary derivatives 
has the property that if y‘” belong to the set all the derivatives of y from 
which y may be obtained by differentiation also belong to the set. Conse- 
quently the wronskian W for the new system is easily seen to be related to the 
wronskian for the old system by the equation 


(15) W=rW. 
Differentiating this equation with respect to u;, we have 


aw aw 
Ou, Oux + Ou,’ 


or, using equations (13) and (11), 


Uk Ou,’ 


where f denotes the function which takes the place of f for the new system of 


differential equations (14). Substituting for W its value as obtained from 
(15), and recalling that W is nowhere zero in the region A, we find that 


of dn 
Ou, OUp Our 


This result enables us to state the theorem: 

THEOREM IX. Suppgse a system of partial differential equations of the 
form (7) has the following properties: 

1°. It is completely integrable, with n fundamental solutions whose wronskian 
(3) vanishes at no point of A. 

2°. The set of primary derivatives is such that, if y® be any one of the set, 
then all the derivatives of lower order from which y may be obtained by differ- 
entiation also belong to the set. 

3°. All of the primary derivatives exist for each of the np coefficients a>” 
(7 =1,2,---,p). 

Then the system of differential equations (7) may be transformed into another 
uniquely determined completely integrable system (14) for which all of the quantities 


(k=1,2,-++,p), 
j=0 
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are zero, by the transformation of the dependent variable y = dy, where 
(16) 0, = const. e//", 


The system (14) thus obtained will be unique, and its coefficients a‘/'” 
will be explicit expressions in the coefficients of system (7) and their deriva- 
tives. Let us call this uniquely determined system the canonical form of 
system (7). 

It should be noted that part 3° of the hypothesis is necessary in order that 
dy may be substituted for y throughout equation (7). 

Let us return to the general transformation y = \j, which converts system 
(7) into system (14); we now suppose that system (14) is not in the canonical 
form. The coefficients of system (14) are expressible explicitly in terms of 
the coefficients of (7) and their derivatives, and \ and its derivatives. Con- 
sider any function F of the coefficients of (14) and their derivatives. Let F 
be the same function of the corresponding coefficients of (7) and their deriva- 
tives. The function F is equal identically to a function of \ and the coefficients 
of (7), and the derivatives of these quantities. If in particular this latter 
function is related to F by the identity 


where r may or may not be zero, we call the function F a seminvariant of 
system (7). 

The following theorem is a consequence of the fact that system (7) and the 
transformed system (14) have exactly the same canonical form: 

TuHeoreM X. The coefficients of the canonical form are seminvariants of 
system (7). All other seminvariants are functions of these and of their derivatives, 
af such derivatives exist. 

The reduction of a completely integrable system to the canonical form is 
seen by the above discussion to be a purely mechanical process, performable 
in any case coming under the hypotheses of Theorem IX. This fact is of 
practical importance in projective differential geometry, if the general method 
of Wilczynski be followed. We shall return to this question in the next sec- 
tion, applying the method to a particular geometric problem. 

In § 8, we shall give an existence theorem for systems of form (7) in which 
the primary derivatives constitute a normal set. It should be noted that 
in the hypothesis 2° of Theorem IX, however, the set of primary derivatives 
is still further restricted; as a matter of fact, we may show by an example 
that this restriction is in general necessary, so that not every completely 
integrable system may be reduced to a unique canonical form. Denoting 
by subscripts partial differentiation with respect to the two independent 


~ 
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variables u and v, let the primary derivatives be y, yu, Yu. These form a 
normal set. If y:, y2, ys be functions of uw, v for which in the region A the 


wronskian 
W =(|y, Yus Yuo| 


is nowhere zero, we may set up the completely integrable system having 
Yi, Y2, Ys as a fundamental set of solutions: 


Yo = ay + byu + cyw, 
Yuu =@ y yuo, 
Yuu = tb’ yn +c yuo, 
Yu = +O Yur, 


where in general c + 0. Making the transformation y = \gj, we find that 
W is transformed into 


W + Au Ts + Fu + Au Jo + Aw J] = (M+ 


in which use has been made of the first of the system of differential equations. 
Now, if ¢ + 0—a condition which may be realized at pleasure—this relation 
between W and W is not of the form (15), so that the differential equation (16) 
which determines is replaced by one of the second order. Thereiore, 
although the system may be reduced to the canonical form, this form will not 
be unique. 

Of course, instead of making the said hypothesis 2° of Theorem IX, we 
might demand merely that the wronskian and its transform be related by 
equation (15). Our example would then be included in this case, if ¢ were 
identically zero in A. 


7. GEOMETRIC APPLICATION; CURVILINEAR COORDINATES* 


Completely integrable systems of the kind we have been considering have 
attained increased importance of late owing to their systematic use in certain 
fields of projective differential geometry. Since any solution of the system is 
linearly dependent on any fundamental set of solutions, it follows that a geo- 
metric configuration defined by a fundamental set of solutions is a projective 
transformation of the configuration defined by any other fundamental set of 
solutions. Consequently, any geometric property expressed in terms of the 
completely integrable system, being common to all the projective trans- 
formations of the configuration defined by a fundamental set of solutions of 
the system, is a projective property. Wilczynski has developed a general 
method for dealing with certain questions from this point of view, and it 


* This section is independent of § 8. 
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is in connection with this method that the results of § 6 find an important 
application. 

Let us consider in particular the general subject of curvilinear coérdinates 
in space of n dimensions.* If y2, be interpreted as homogeneous 
coérdinates of a point in this space, the equations 


(17) Yi = Yi( Ur, Us, Un) (¢=1,2,+--,n+1) 


will define a system of curvilinear coérdinates, provided the ratios of the y’s 
do not all reduce to functions of fewer than n of the independent variables 
U1, U2, ***, Un. We shall suppose that all derivatives of any order exist for 
the functions y;. Since at least one of these functions, say yn41, is different 
from zero at some point, it will be different from zero in an n-dimensional 
region about this point. We wish, then, the analytic condition that the n 
ratios i = yi/Ynti, (1 =1,2,---,m) actually define ©” points. This 
condition is the non-vanishing of the jacobian of the functions 7;, and this 
may be reduced to the non-vanishing of the determinant 


Yrs 


where y/\”, x”, etc., denote dy:/du;, Ay2/du;, etc. We may therefore set 
up a completely integrable system of form (7) of which the n + 1 functions y; 
constitute a fundamental set of solutions, and in which the primary deriva- 
tives arey, y, y™. This system is 


dy? 
Vk i=0 


(19) 


Let us differentiate W’ with respect to u,; then on using equations (19) we 
find without difficulty that 


ow — 
+ af? + --- W. 
We may therefore put 
of 
(HB) k =1,2,+++,n). 
La 


* The case n = 2, i. e., nets of plane curves, has been studied by Wilczynski, One-parameter 
families and nets of plane curves, these Transactions, vol. 12 (1911), pp. 473-510. 
The present writer considered the case n = 3 in his Columbia dissertation, Projective differ- 
ential geometry of triple systems of surfaces, Lancaster, Pa., Press of The New Era Printing 
Company, 1913. 


| 
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Now, there is an arbitrary feature introduced in equations (17) because of 
the homogeneity of the coérdinates. If \ be any function of wu, we, +--+, Un, 
then the points defined by the coérdinates 7; (¢ = 1, 2, ---,m +1) will be 
the same as those defined by equations (17), if y; = \y;. Consequently, if a 
projective property of the configuration is to be expressed in terms of the 
coefficients of the completely integrable system (19), it must be in terms of 
what we called in § 6 the seminvariants of the system, viz., those combinations 
of the coefficients of equations (19) and their derivatives which remain essen- 
tially unchanged when the system is subjected to an arbitrary transformation 
of the form 

y = dry. 

Let us carry out this transformation of the dependent variable, thus con- 
verting system (19) into one of the same form, 

(20) = =a (j,k =1,2, -++,n). 
120 

The coefficients of this system are functions of the coefficients of (19) and 

their derivatives, and also of \ and its derivatives. In particular, 


(j,k =1,2,--+,m), 
so that, on denoting by df/du, the quantity >>7_, a”, we find directly that 


On 
dup dup (n+1)50 (kK =1,2,---,m). 


This also follows from the general results of § 6. Consequently, the quantities 
f® = df/du, will all be zero if \ be chosen as a solution of 
— (n+1)A =0 (k=1,2,---,mn), 
which gives 
= const. e//(**D , 
With this choice of \ the coefficients of the new system (20) have the values 
1 


(21) =(j,k) as k) 1 (x (3, of? (3) f(k) 
a) = | > as f 


- k 
ay” 


= ah” (i+j,k), as = a k) —f®, 


a= ah” — t,j7,k =1,2,---,n), 
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where throughout single superscripts denote differentiation. The quantities 
(21) are the fundamental seminvariants of system (19), and any seminvariant 
of the system is a function of these and their derivatives. 

There still remains, however, an arbitrary feature in equations (17). This 
is in the choice of parameters. Since the system of curvilinear codrdinates 
remains unchanged under the transformation 


(22) = U; (ur), (kK =1,2,-:-,n), 


it follows that projective properties of the configuration are expressible in 
terms of those seminvariants which remain essentially invariant when system 
(19) is subjected to the transformation (22). It is possible to calculate 
explicitly, and without any difficulty, these invariants, but our object here is 
not to give a systematic theory of curvilinear coérdinates. 

The fact that even in this very general problem the invariants may be 
calculated explicitly, makes it highly probable that the analytic methods of 
Wilczynski are practicable in handling problems in the projective differential 
geometry, not only of configurations in spaces of a few dimensions, but also 
in space of n dimensions. The only methods which have hitherto been used 
systematically in studying configurations in hyperspace are those of Segre.* 
The ideal method seems to the present writer to be one in which all the pro- 
jective differential properties are expressible by means of a unique analytic 
apparatus, and this is to be found in the completely integrable system of 
which a fundamental system of solutions defines the particular geometric 
configuration under discussion. 


8. THE EXISTENCE THEOREM FOR COMPLETELY INTEGRABLE SYSTEMS. 
THE LINEAR DEPENDENCE OF SOLUTIONS 


The theorems of previous sections concerning the system of partial differ- 
ential equations (7) presuppose the existence of a fundamental system of n 
solutions. The question of this existence presents no essentially new diffi- 
culty, for the remark made at the end of § 5 renders immediately applicable 
to system (7), which is supposed to be in the normal form, a well-known exist- 
ence theorem concerning systems of total differential equations. 


* For references to some of this work, see E. Bompiani, Recenti progressi nella geometria 
proiettiva differenziale degli iperspazi, Proceedings of the fifth international congress of mathe- 
maticians, Cambridge, 1912, vol. II, pp. 22-27. 

In this connection should be mentioned also Prof. Wilezynski’s Cleveland address, Some 
general aspects of modern geometry, Bulletin of the American Mathemati- 
cal Society, vol. 19 (1913-14), pp. 331-342. 

t Cf., for example, Ch.-J. de la Vallée Poussin, Cours d’Analyse Infinitésimale, 2me éd., 
vol, 2, chap. VII, §6. The theorem is essentially that of Mayer, given in the memoir cited 
in §3. 
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A simultaneous system of n total differential equations in n dependent 
variables 2, (h = 1,2, ---,m) and m independent variables y; (1 = 1, 2, 
-, m) is of the form 


(23) da, = af? dy; 
(h =1,2,-+++,n; i =1,2,-++,m), 


the a’s being continuous functions of the 2’s and y’s in a region A. Let us 
write 
Then we shall say that system (23) is completely integrable if its coefficients 
satisfy the identities 


(i) (ke 
(24) ba; ba,” 


We shall now state the existence theorem for this system. 

THEOREM. Suppose that the identities (24) subsist for system (23) throughout 
the region A, and let y¥, ---, y; 2, ---, a& be a set of values of the inde- 
pendent and dependent variables in the interior of this region. Then there exists 
one and only one set of n functions x,, «++, 2, of the m independent variables 
Yi» ***» Ym, Which satisfy the system of differential equations and which respec- 
tively take on the values x”, x when = Ym = 

Consider now system (7), wg 


— 
= > a ) y 


i=0 
(j = 0,1, 1; k =1, 2, 


in which we suppose that there are exactly n primary derivatives, and that 
they form a normal set. Let us regard y, y™, ---, y-” as dependent 
variables; then the system becomes a very particular case of the system (23) 
which we have just been considering. The integrability conditions (24) take 
precisely the form of equations (9). Moreover, from the remark made at 
the end of §5, system (7) when regarded as a system of partial differential 
equations of the first order with the dependent variables y, y™, ---, y*~» is 
exactly equivalent to system (7) with its original significance, viz., that in 
which y, y, ---, y are primary derivatives of the function y, each 
superscript denoting a certain particular, though unspecified, partial deriva- 
tive of y. The exact equivalence of the two points of view follows from the 
fact that the primary derivatives form a normal set. In other words, if 
system (7) be regarded as a system of partial differential equations of the 


(4,4 =1,2,-++,m;h =1,2,---,n). 
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first order in n dependent variables, then a set of solutions y, y, «++, y*» 
will have the property that y® , y®, ---, y“~» are exactly those derivatives 
of y which the original superscript notation signified; and all this without 
requiring the addition of new equations to system (7). 

The force of these remarks, especially the last, may be seen by means of 
the completely integrable system in two independent variables u and v, with 
primary derivatives y, Yu, Yv: 


Yuu = Ayu + by, + cy, 
(a) Yuu = tbyte'y, 
You =a" 

where the subscripts denote differentiation. This system may be replaced 
by a system with three dependent variables by means of the substitution 
(b) y=y, 
and will then have the form 

y = ay + by® + cy, 

=a’ y® +B y® 

y? = al y® + +e y. 


This is not in the form (7), however, because all of the first derivatives of 
all of the primary derivatives do not appear in the left-hand members of 
system (a’). But equations (a’) and (b) together constitute a system of 
form (7). System (a’) alone is not equivalent to system (a), but systems (a’) 
and (b’) together are equivalent to system (a). 

A better illustration is afforded by the system of form (7) introduced at 
the beginning of § 5, viz., one in which the primary derivatives are y and yuu, 
and which we there saw had no solution different from zero. If we put 


(a’) 


Yuu = y, 
then the system in form (7) is 
Yu = ay + by®, y=aytd'y®, 
ay tb’ y®, 


(1) 


Yu 


These four equations, regarded as equations of the first order in the dependent 
variables y, y™, will have sets of solutions y, y™ different from zero if the 
integrability conditions (9) are satisfied for the system. But the function y™ 
will not in general be the proper derivative of the function y, viz., yuu. This 


| 
i 
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can not happen in the case of the other example, because equations (b), which 
form part of the system in the form (7), make it certain that a solution y, y™, 
y™ of the system consisting of equations (a’) and (b) will have the property 
that y™ and y® will be the proper derivatives, y, and y, respectively, of y. 

Let us, then, regard system (7) as a system of partial differential equations 
of the first order in the n dependent variables y, y, ---, y". Applying 
the existence theorem for system (23), of which (7) is a particular case, we 
may state the following existence theorem: 

TuHeorEeM XI. Suppose that in the system of partial differential equations 


(3, ®) 
(7) 
(j =0,1,---,n—1;k =1,2, -+-,p) 
the primary derivatives y, y , ---, y form a normal set. Suppose further 


that in the region A the coefficients as”, which are functions of the p inde- 
pendent variables u,, U2, +++, Up, satisfy identically the integrability conditions 


da k) n—1 n—1 
v 
al’: k) a’ | > al’ a k) 
Ou; 1=0 i=0 


(v¥,j7 =0,1, — k,l=1,2, Dp). 


(9) 


(0) 


Let (ui, uy?, «++, us?) be any point of A, and yo, yi’, ---, ys? be any 
set of n constants. Then there exists one and only one function y of the variables 
Uy, U2, ***, Up which satisfies the system of differential equations, and whose 
primary derivatives y, y, ---, y"™ take on respectively the preassigned con- 
stant values yo, y, «++, at the point (u\”, uf, 

As in the familiar case of ordinary differential equations, this theorem 
allows us to infer the existence of nm linearly independent solutions whose 
wronskian vanishes nowhere in A. In fact, we may construct a particular 
set of solutions having this property, by choosing solutions Y;, Y2, ---, Yn 
such that the primary derivatives of the solution Y; take on the values 


YP=O0(j#t-1), YY =1 (7 =i-1) 


(0) 


at the point (u{”, uf", ---, uw”). At this point the wronskian of Yi, Ye, 


-, Y, is then 


Be 0, 

0, 

= 1, 
sa, 


so that the wronskian vanishes nowhere in A, by Theorem VIII. Moreover, 
that solution of the system for which the primary derivatives take on the 
at the point Up) is given by 


values yo, y, 


1916] LINEAR DEPENDENCE OF FUNCTIONS 515 


y=yVity +y 


If a solution has all of its primary derivatives zero at any point of A, it 
therefore vanishes identically in A. Let y1, y2, «++, Yn be any n solutions 
of the completely integrable system; then no function, other than zero, of 
the form 


can vanish together with its derivatives y, y®, ---, y“ at any point 
of A, unless the constants c; are all zero. This set of functions y1, y2, +++» Yn 
has therefore the properties of the n functions of Theorem III. We may 
therefore apply this theorem to prove the following one, which is also a conse- 
quence of the existence of n solutions whose wronskian vanishes nowhere 
in A: 

TueoreM XII. If y:, yo, +++, Yn be any n solutions of the completely 
integrable system of Theorem XI, then the vanishing, anywhere in A, of the 
wronskian of these n functions is a sufficient condition for their linear dependence. 

This theorem is included in the following more general one. They are 
both generalizations of theorems given by Bécher for ordinary differential 
equations.* 

TueoreM XIII. Jf y1, yo, yx (kK Sm) be solutions of the completely 
integrable system of Theorem XI, a sufficient condition that they be linearly 
dependent in A is the identical vanishing in A of all those k-rowed determinants 
of the matrix 

M = My-i(yi, yo, Ye) 


which contain elements of the first row of the matrix. 

The proof of this theorem follows immediately from Theorem IV. The 
matrix M of Theorem IV is in the present case the matrix formed from the 
primary derivatives of the functions y1, y2, --:, yx, while the augmented 
matrix M’ has additional rows consisting of the first derivatives of the quanti- 
ties appearing in M. But these first derivatives are linear combinations of 
the primary derivatives, in virtue of the completely integrable system, so 
that if any v columns (v S £) be selected from M, the vanishing of all the v- 
rowed determinants formed from these columns and containing elements 
of the first row of M carries with it the vanishing of all the v-rowed deter- 
minants formed from the same v columns of M’ and containing elements of 
the first row of M. Moreover, no solution of the completely integrable sys- 
tem, other than zero, can have all of its primary derivatives vanish simul- 
taneously at any point of A. Consequently all of the hypotheses of Theorem 
IV are satisfied by our functions y1, y2, «++, yz, and the present theorem 
follows at once. 


* M. Bocher, loc. cit., these Transactions, vol. 2 (1901), pp. 139-149, theorem VII. 
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Enough has been given to show the marked analogy between ordinary 
homogeneous linear differential equations of the nth order and the completely 
integrable systems which have come under discussion in this paper. This 
analogy indicates the direction in which facts concerning ordinary differential 
equations may be extended to systems of partial differential equations. It is 
obvious how an adjoint system may be defined; the question naturally presents 
itself as to whether boundary value problems similar to those for ordinary 
differential equations may arise in connection with completely integrable 
systems. Questions concerning analytic functions of several variables suggest 
themselves also. In particular, can the solutions of a given completely 
integrable system be characterized by their properties as analytic functions, 
that is, does there exist a solution of the Riemann problem for such a system? 
Unfortunately, the theory of functions of several variables has not yet reached 
a degree of development comparable with that of functions of one variable, 
so that it is doubtful whether the generalization is feasible at present. How- 
ever, all of these subjects certainly deserve investigation. 


Harvarp UNIVERSITY, 
November, 1915 
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ON THE FACTORIZATION OF EXPRESSIONS OF VARIOUS TYPES* 


BY 


HENRY BLUMBERG 


INTRODUCTION 


Not a few mathematicians have dealt with the problem of setting up criteria 
by means of which the irreducibility of certain expressions in certain domains 
may be seen at a glance from the character of the expressions. Gauss, Kro- 
necker, Schoenemann, Eisenstein, Dedekind, Floquet, Koenigsberger, Netto, 
Perron, M. Bauer, and Dumas have written on the subject.t A simple 
example of such criteria is the well-known Schoenemann-Eisenstein theorem, 
which is essentially as follows:{ If the polynomial 


C=y* + ay" 
with integral coefficients, is such that a fixed prime p goes into every 
c,{v = 1,2, nj, 


but p® does not go into c,, then C is irreducible in the domain of rational 
numbers. This theorem, while comparatively not rich in content, may be 
regarded as the nucleus of the work of the authors mentioned. 

With the exception of Floquet and Koenigsberger, the authors mentioned 
deal exclusively with the case where the expressions are polynomials and 
chiefly with the case of polynomials whose coefficients belong to the domain 


* Presented to the Society in less general form, November 28 and December 29, 1914. 

¢ Gauss, Disquisitiones arithmeticae (1801), Art. 341; Kronecker, Journal fiir Math- 
ematik, vol. 29 (1845), p. 280, vol. 100 (1887), p. 79, Journal de Mathéma- 
tiques, vol. 19 (1854), p. 177, ser. 2, vol. 1 (1856), p. 399; Schoenemann, Journal fiir 
Mathematik, vol. 32 (1846), p. 100, vol. 40 (1850), p. 188; Eisenstein, Journal 
fir Mathematik, vol. 39 (1850), p. 160; Dedekind, Journal fiir Mathe- 
matik, vol. 54 (1857), p. 27; Floquet, Annales de l’école normale (1879) 
[cited by Koenigsberger]; Koenigsberger, Journal fir Mathematik, vol. 115 
(1895), p. 53, vol. 121 (1900), p. 320, Mathematische Annalen, vol. 53 (1900), 
p. 49; Netto). Mathematische Annalen, vol. 48 (1897), p. 81 and Algebra (1896), 
vol. 1, lecture 5; Perron, Mathematische Annalen, vol. 60 (1905), p. 448, Jour- 
nalfiir Mathematik, vol. 132 (1907), p. 288; M. Bauer, Journal fir Mathe- 
matik, vol. 128 (1905), p. 87 and p. 298, vol. 132 (1907), p. 21, vol. 134 (1908), p. 15; 
Dumas, Journal de Mathématiques, ser. 6, vol. 2 (1906), p. 191. 

t Schoenemann and Eisenstein, 1. c. This theorem has been wrongly attributed to Eisen- 
stein alone by most of the mathematicians who have had occasion to refer to it. 
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of rational numbers. Floquet and Koenigsberger have extended the investi- 
gation to linear homogeneous differential expressions. 

A variety of methods have been employed. Thus the theory of algebraic 
fields, the theory of algebraic functions and the character of the solutions of 
linear homogeneous differential equations have been used.* Elementary 
methods, requiring no such means and based on the study of the expression 
as such without regard to the character of the solutions of the equation obtained 
by setting the expression equal to zero, have mostly succeeded in yielding 
only the less general results. f 

The principal aim of the present paper is to communicate a general theorem 
and certain immediate consequences of it, which include as special cases, 
with minor exceptions,{ all the results heretofore obtained in the indicated 
field of investigation. This theorem is given in § 6, and for convenience of 
designation referred to as the “ Theorem on the Product ”’; its direct conse- 
quences are the “ Theorem on Common Divisors” ($6), the ‘“ Factoriza- 
tion Theorem” and the “ Criterion for Irreducibility”’ (§7). Thus the 
work on the subject before the publication of the Schoenemann-Eisenstein 
theorem is summarized and generalized by that theorem, which is an exceed- 
ingly special case of the “ Criterion for Irreducibility.”” Moreover, the 
Floquet theorem, all the results in the first and the third papers of Koenigs- 
berger, the chief results in his second paper, all the results of Netto, all the 
results in the 1905 paper of Perron, the chief results in his 1907 paper and 
the chief results of Bauer and of Dumas come under this “ Theorem on the 
Product.’’§ 

A former note|| was intended to give a summary of the present paper. 
Since then, however, the author has found how to unify and to generalize 
his previous results by means of the present theory. The note materially 
extends most of the known results in the field under discussion. For further 
details, the reader is referred to the note itself. Its relation to the present 
paper may be approximately described as follows: While the “‘ Theorem on 
the Product ” asserts that every pair (m,,t;), {f =1,2,---, h}, of the 
characteristic set S¢ satisfies the condition imposed upon it by the relation 
S, + Sz = Sc, the note makes use merely of the fact that the first pair 
satisfies this condition.{ Attention is also directed to the replacing of the 
three theories of the note by the single theory T, of the present paper. 


* Heretofore the results have not been carried over into the field of difference equations, 
possibly because of the recency of the publication of existence theorems in that field. 

t Dumas’s memoir, |. c., is a notable exception. 

t For example, theorem I of Perron’s 1907 paper, I. ¢., p. 291. 

§ See previous citations. 

||Proceedings of the National Academy of Sciences, June, 1915, 
pp. 374-381. 

] See sections 4, 5, and 6. 
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The large variety of cases covered necessitates an abstract, postuiational 
treatment. The different special theories (for polynomials with various 
kinds of coefficients and for difference and differential expressions with various 
kinds of coefficients) are thus made to appear as instances of an abstract 
general theory T, (see §§3 and 8). Only in such fashion can the inter- 
connection of the more special results be brought into clear light and their 
unification effected. This abstract treatment is, moreover, especially fitting 
here, because a small number of simple assumptions is sufficient for the founda- 
tions of the theory. 

The “ Theorem on the Product” applies to Situations 4, 5 (for k>1), 
6, 7, 8, 9 ($3), not treated before, as far as the author knows, and enriches 
in a marked degree the results for Situations 3 ($3) and 10 (§8) already 
studied. Nevertheless, it does not go beyond the results of Dumas for the 
case of polynomials whose coefficients are rational numbers or Hensel p-adic 
numbers. Our general theory may, in fact, be regarded as an extension to 
an abstract situation of Dumas’s theorem, I. c., p. 217.* 

The remarks at the close of §7 indicate the great difficulty in obtain- 
ing a more exhaustive criterion for irreducibility than the one actually 
obtained. 

A sharp distinction has been made in the literature (as regards polynomials) 
between those investigations that are based on the divisibility properties of 
the coefficients and those that proceed from the consideration of the magni- 
tude of the coefficients.t One of the interesting results is that the gap between 
these two types of investigation is bridged, both appearing subordinate to our 
general theory. 

It is easy to see, as Koenigsberger has pointed out by an example, that the 
Schoenemann-Eisenstein theorem cannot be directly extended to the case of 
linear homogeneous differential expressions. Our abstract treatment further- 
more lays bare the underlying reason—by no means evident otherwise— 
why it breaks down, and at once indicates what analogous theorem may 
replace it. The “ Criterion for Irreducibility ” may, in fact, be properly 
regarded as a highly generalized Schoenemann-Eisenstein theorem for the 


*It is interesting in this connection to compare several remarks in Perron’s 1907 paper, 
l. c. Referring to the work of Dumas on polynomials with rational coefficients as related 
to that of his predecessors, Perron says (p. 288): ‘‘ Zu etwas allgemeineren Resultaten ist 
neuerdings Herr G. Dumas gelangt.’’ Later on in the same paper (p. 304), Perron designates 
his most general result, Theorem VIII, as “ein sehr allgemeines Kriterium.” The italics 
are in both cases my own. It turns out, however, that Perron’s criterion is a special case of 
our theory, which is a natural and easy generalization of Dumas’s results. 

+ Cf. Perron, 1907, 1. c., p. 288 and Loewy in Pascal’s Repertorium (1910), Analysis II, 
pp. 292 and 293. The same kind of distinction may, of course, be made with regard to differ- 
ential and other expressions. 
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case of linear homogeneous differential expressions with coefficients taken as 
in Situation 6, 7, or 10.* 

Here is a brief summary of the contents: Section 1 shows by means of an 
illustrative example how the Theory T_, of Section 2 arises. Section 3 de- 
scribes various concrete situations falling under the Theory T_1. Section 
4 defines “ characteristic set’’—a fundamental notion for our purposes—and 
Section 5, the “sum of two characteristic sets.” Section 6 gives the proof 
of the “Theorem on the Product” by the aid of two lemmas (established in 
§§4 and 5). In Section 7, the “ Factorization Theorem ” and the “ Criterion 
for Irreducibility ” are obtained. Section 8 introduces the abstract theory 
Z, for every real ¢« and fore = — ©. The appendix gives concrete examples 
of our results, at the same time exhibiting many instances of contact with 
the literature. 

To secure greater clearness, nearly all the letters used are made to have a 
fixed meaning throughout the paper. Thus, for example, throughout the 
paper, A, B, C stand respectively for (ao, a1, «++, Gr), (bo, bi, ---, ds), 
(Co, C1, &, By, respectively for the “rank” of a,, b,, ¢, (see 
§ 2); (me, te), (ma, t,)}, for the “ characteristic set” S¢ 
of C (see § 4); ete. Slight departures from this state of affairs will be evident 
from the context. 


1. AN ILLUSTRATIVE EXAMPLE SUGGESTING THE ABSTRACT TREATMENT 


Instead of proceeding directly to the exposition of the abstract theory 
mentioned in the introduction, we’ shall foreshadow the abstract situation 
by drawing certain pertinent conclusions in regard to one of its concrete 
instances.t In this way, we believe, the assumptions used in § 2 will be suf- 
ficiently motivated for the reader. 

Let S denote the class of rational functions of x with complex numerical 
coefficients; and let 


n—1 


d” d 


be a linear homogeneous differential expression whose coefficients belong to S. 
We are concerned with the representation of C as a product A - B, where 


d’ 
A= ao (x) + ay (2) + ba,(x)y 


* Curiously, Koenigsberger himself has obtained results (essentially for Situation 10, § 8) 
that may be properly regarded as extensions of the Schoenemann-Eisenstein theorem to 
differential expressions. Lacking the abstract background, he failed, however, to notice 
this relation. 

+ The instance given has actually been the starting point of the author’s present investi- 
gations. 
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and 
d® 


the coefficients a, and b, belonging to S. As usual in such cases, A - B 
means here the symbolic product of the two differential expressions, so that 


a’ B 


A tage t ta,B 

dts db 

= do bo + ( + a; bo + ao bs ) + 

r+s=n y 
where 


( ) (r—d)! 
y—-rA—p) typ)! 
Understanding, as usual, by the degree of a rational fraction the degree of 
its numerator minus the degree of its denominator, we desire to call attention 
to several evident relations between the degrees of the c’s on the one hand 
and the degrees of the a’s and of the b’s on the other. From the expression 
of c, in terms of a, and b,, we see that c, is a sum of products of the form 
b,/dz? , where X + u +o =v and h,,, is a numerical coefficient. 
We now use the following facts regarding the degrees of products, sums, and 
derivatives of rational fractions: (1) The degree of a product is equal to the 
sum of the degrees of the factors. (2) The degree of a sum is at most equal 
to the highest degree d attained by its terms, and exactly equal to d if only 
one term has the degreed. (3) The degree of df (x)/dz,f (x) being a rational 
fraction, is at most one less than the degree of f(2). To make these laws 
hold without exception, we agree to set the degree of 0 equal to — ~ and 
to calculate with — « in the usual manner. Thus —-» +l1=—o, 
etc. Let a,, B,, y, denote the degrees of a,, 
b,, ¢,. It is then clear that y, is at most equal to the largest value attained 
by the numbers of the set a, + 6, —o, whend, uw, anda vary so as to satisfy 
the relation’ + »+o02= vy. This fact we express conveniently as follows:* 


* The fact that o =r — d really imposes more stringent conditions upon , than we use. 
Because of our weaker conditions, we gain not only generality but also the important advantage 
of symmetry, by virtue of which later proofs are materially shortened. Condition (b) below 
is also weaker than the equivalent of the second part of (2) above (regarding the degree of a 
sum). 
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(a) max (qa +6, —o7),{v=0,1,2, 
Atuto=v 


Furthermore, suppose that for the fixed values \ = X, p=n, c=0 
(where A+R a; +6; is greater than a, + 6, —o for every [A, 
+ [X, 2,0] satisfying the relation \ + » +o = »; it follows that y, = a; 
+ B;. Hence, in addition to (a), we have: 

for every [A, un, = [X, O], = 0,1, 2, n}. 

It will appear in the sequel that the simple and evident properties (a) and 
(b) of linear homogeneous differential expressions with rational coefficients 
essentially suffice for the demonstration of our factorization results concerning 
such expressions. But the properties (a) and (b) are shared by many other 
expressions (see the instances in $3). To dispense with the duplication of 
proofs and for other reasons given in the introduction, we proceed (in the next 
section) to the exposition of an abstract theory T_1, of which the illustrative 
example described in this section is a special instance.* The domain © of 
rational functions will be replaced by an unrestricted aggregate ©; the differ- 
ential expressions, by ‘“ parentheses” of ©; the degree of a rational func- 
tion, by the “‘ rank” of an element of ©; and the product of two differential 
expressions, by the ‘‘ product ” of two “ parentheses,” the abstract “ product ” 
being subjected to postulates corresponding to the properties (a) and (0). 


2. THE ABSTRACT THEORY T4 


Let S denote any given aggregate whatsoever.t We shall deal with finite, 
ordered subaggregates E = (€, €2, €m) S,e, {vy = 0,1,2,---,m} 
being an element of ©. Such a finite, ordered subaggregate E we shall call 
a “ parenthesis of S.”” We assume that the “ product” 


A-B= (ao, a, a2, +++, dr) + (bo, bi, be, -++,b,) 
of any two parentheses A and B of © is equal to a parenthesis 
C= (Co, C1, C2, Cn) 


of S for which n = r +s. We assume furthermore that with every ele- 
ment e of © there is associated a single number 7 called the “rank” of e, 
n being a real number or — © (never + «),f{ such that when 4-B=C 


* The reason for the notation “‘T_,” lies, as indicated in the introduction, in the fact that 
this abstract theory is later (§ 8) exhibited as a special case of a more general theory T., the 
latter reducing to the former ife = — 1. 

t In regard to the use of the unrestricted aggregate (= general range) and several other 
features of this paper, such as the postulational unification of analogous theories, compare E. 
H. Moore, /ntroduction to a Form of General Analysis (1910), preface. 

t In all the concrete situations 1-10 incl., with the exception of 4b, 7b, and 9b, 7 happens 
to be always an integer. The proof of the ‘‘ Theorem on the Product ” is, however, not ren- 
dered more difficult if 7 is merely restricted to being real. 
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the following properties hold for the “ranks” a, 8, , y, of q,6,,¢,, {vy =0, 
1,2, ---,n}:* 
(a) max (aq +8, 58,050}. 


Atuto=v 

for every [A, #[A, 4,0], {OSA S7,05S 458,050}. 

The “order” of the parenthesis E = (éo, «++, @m) is defined as n — k, 
where k is such that yn =— ©, =— ©,°++,m1=— ©,andm ©. 
If every n, {v = 0,1, ---,m}is — o, the order of E is defined to be 0. 

“ Reducibility,” “ irreducibility,” “ factor,” etc., are defined for T_; (and 
later for T,) in expected fashion. Thus C is said to be “ reducible”’ if two 
parentheses A and B of © exist, such thatr 2=1,s = landC=4A-B. 


3. INSTANCES OF THE ABSTRACT THEORY T_; 


We shall now describe various important situations that fall under the 
abstract theory T_,. For this purpose, we shall define in each case the 
aggregate ©, the parentheses of ©, the product of two parentheses and the 
rank of every element of ©. The verification of the properties (a) and (6) 
of § 2 for the instances given below is left to the reader. 

Situation 1. © consists of the set of rational numbers. We understand 
by the parenthesis €1, €2, €m) of S the polynomial 


coy” +ery™ + + em 
in the letter y. The product of two parentheses 
(do, a1, +++, + (bo, bi, ++, be) 


is defined, as usual, to be equal to the parenthesis (co, ¢1, «++, ¢n), where 
n=r+esand 


Co = = dob; + bo, Ca = a, by. 


The rank of an element e = e’/e’’ (where e’ and e” are integers) is defined 
with reference to a fixed prime p. First let e + 0; let e’ be divisible by p® but 
not by p?*; e”, by p®” but not by p***. We then define the rank of e by 
the equation 7 = d’’ — d’. Moreover, we naturally enough define the rank 


of 0 tobe — 
Situation 2. GS consists of the class of Hensel’s p-adic numbers. The 
parentheses of S and the products of parentheses are defined as in 1. The 


* Because of the nature of our assumptions for T_:, which involve essentially only the 
ranks of the elements of ©, it is possible to build up just as general a theory as T_; by dealing 
exclusively with parentheses whose elements are ranks and hence always real numbers or 
— ©. We have here an example of an apparently very general theory (based on an un- 
restricted aggregate and involving few assumptions) that is equivalent to an apparently very 
special one (about parentheses of real numbers). 
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rank of the p-adic number e is the negative of Hensel’s “ order” of e with 
respect to p.* 
Situation 3. © consists of the class of rational fractions 


in k letters, x,, 22, +++, 2, e’ and e” being polynomials with arbitrary com- 
plex numerical coefficients.t| The parentheses and the products of parentheses 
are defined as in 1. The rank of an element ¢( + 0) is defined by the equa- 
tion » = d’ — d”, where d’ and d” represent respectively the degrees (in the 
usual sense) of e’ and e” in the & letters 2,, 22, ---,2z,. The rank of 0 is 
defined to be — «. 

Situation 4a. consists of the collectivity of elements e = v, 
where the v’s are arbitrary complex numbers, 7 is an integer and also v + 0 
unless ¢ = 0; that is, e is a Laurent series having only a finite number of 
terms with positive exponents.t The parentheses and the products are 
defined as in 1. The rank of e( + 0) is defined to be the exponent 7 of z in 
the first non-zero term of the development of e. The rank of 0is — ~. 

Situation 4b. S consists of the collectivity of series >=? v, 2” where 
6 and 6 (> 1) are integers; the elements of © are thus descending series in 
integral powers of z"*. Otherwise, as in 4a. 

Situation 5. ©S consists of the collectivity of fractions e = e’/e’’, where 
e’ and e” are power series in & letters 2, 22, «++, 2, with arbitrary complex 
numerical coefficients: 


Al, Ag=O 


The parentheses and products are defined as in 1. The rank of e (+0) is 
defined as d’’ — d’, where d’ and d” represent the lowest degrees (in the usual 
sense) in the & letters 2,, --- , 2% of a non-zero term of e’ and e”’ respectively. 
The rank of 0is — 

Situation 6. consists of all rational fractions e(x) = e’(x)/e’’ (zx), 
e’(x) and e’’(x) being polynomials with arbitrary complex numerical coef- 


*See Hensel, Journal fiir Mathematik, vol. 127 (1904), pp. 51-84, or Zahlen- 
theorie (1913), chapters 3 and 6. 

t More generally, the numerical coefficients may belong to any abstract system (K,+,X), 
where K is a class such that if a and b are elements of K both a + b and a X b are elements 
of K. This remark applies just as well to other situations below. 

t The question of convergence does not enter here because the formal character of the 
series is sufficient for our purposes. More generally, we may have similar series in two or 
more letters, the requisite change in the definition of rank being evident. 

§ Situation 6 is the illustrative example of § 1. 
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ficients. The parenthesis (é, ¢1, -++, m) is the linear homogeneous differ- 
ential expression 


d™ 


The rank of e is the degree of ¢ (in the usual sense), and the product of two 
parentheses is the symbolic product of the two corresponding differential 
expressions. 

Situation 7a. © and rank are defined as in 4a; parentheses and products, 
as in 6. 

Situation 7b. © and rank are defined as in 4b; parentheses and products, 
as in 6. 

Situation 8. © and rank are defined as in 6. The parenthesis 


(€0, €1, &m) 
of © is the linear homogeneous difference expression 
€0 Yrim + €1 + + €m Yz- 


The product A - B of two parentheses is the ordinary symbolic product of the 
corresponding difference expressions: 


A-B= (do, (bo, bi, = (Co, C1, 


where n = r+ sand 


= +4 — 9), {y=0,1,2,---,n}. 
p=0 


Situation 9a. © and rank are defined as in 4a; parentheses and products, 
as in 8. 

Situation 9b. © and rank are defined as in 4b; parentheses and products, 
as in 8. 


4.* THE CHARACTERISTIC SET Sg BELONGING TO C 


From now on (till § 8), our discussion is based on the assumptions for the 
abstract theory T_; of §2. In the present section, however (and also in §5), 
we make use merely of the fact that every element of S has a rank (which is a real 
number or — « ).f 
Let C = (¢9, ¢1, --+,¢n) be a given parenthesis of S, y, being the rank of 
ce, {vy =0,1,--+,n}. We assume throughout in what follows that yo is finite 


* At this point, the reader may pass directly to § 8 for the definition of T.; he will thereby 
be enabled (on second reading, after §§ 4-7 incl.) to verify more easily the statements of § 8. 

t On the other hand, if we have no regard to (a) and (b) of § 2, the definition of “‘charac- 
teristic set”’ here given appears highly artificial. 
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(i. e., + — ©). Let, moreover, n 21 (a condition to be subsequently 
dropped). With every such parenthesis C we shall associate in unique fashion 
an ordered set S- of number pairs: 


Sc= {(my, ti), (m2, te), (mp, th)}. 


This set, which we call the “ characteristic set belonging to C,” we now 
define.* _ For short, we designate t, by & {f =1,2,---,h}; =0. 
We set m, equal to max, 50 (7, — Yo/v) except when this maximum is < — 1; 
in the latter case, we set my = —1. If m, > — 1, we set t; equal to the 
last vy (vy = 1,2,---+,m) for which (y, — yo)/v = m; and if m, = —1, 
we set = In general, we define and 8;4;-(and hence ¢,,:) for 
¢=0,1,---,h2 — 1 as follows: 


: Ys 
= — 1 }. 
v>se 


last v > 8, such int = Mer, Uf M1 > — 1; and 


So41 =n, af Me+1 


The following relations are direct consequences of the definition of S¢: 


A 
=DLt=n; t21 {vy =1,2,-++,h}. 


v=1 
(b) > m>--- 


Te" Yv— Ys 


(d) Yor — = Me of > — 1; hence 
¢ 
dm, t, ifm >—1. 


As in the theory of well-ordered sets, we call the set 


* This ‘‘characteristic set” is the natural generalization of the “polygon of Newton” or of 
the “Puiseux numbers”’ for our abstract situation T_1. The “generalized polygon of Newton’”’ 
(for T_1) may be essentially obtained by starting at the origin of a rectangular system of 
axes, and drawing a broken line such that the slope of the ¢th segment is m and its projection 
on the X-axis, ¢-. This broken line is, according to 1b below, concave downward. The 
author might have made the entire treatment (including the proof of the “‘ Theorem on the 
Product ”) depend on the properties of this broken line; he has chosen, however, the equivalent 
analytical procedure. Cf. Hensel and Landsberg, Theorie der Algebraischen Funktionen einer 
Variabeln (1902), vierte Vorles., pp. 39-47 and Bauer, Journal fir Mathematik, 
vol. 132, pp. 21-32. 

t In the theory T_,, the number — 1 plays an exceptional réle; in the theory Te (see § 8) 
the number e plays the corresponding réle. 


— 
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Soy = {(mr, tr), (mg, 


for = 1,2,---,ha “ segment” of Sc. The set Seo, is evidently the char- 
acteristic set of the parenthesis (co, ¢1, +--+, Cs a) of ©. It is convenient in 
the sequel to adjoin the null-segment 0 = Sg, of Sc and regard it as the charac- 
teristic set of every parenthesis (co) of order 0. As indicated above, we thus 
extend the definition of characteristic set to the case n = 0. Every charac- 
teristic set has one and the same null-segment 0. 

Lemma* I. For every (v,¢), {vy 
such thatm, > — 1, we have y, — (v — mz and also S (v — 8¢) me41. 

Proof. If v = 8,, the lemma is trivial. If v > s; (whence mz; exists), 
the lemma follows from le and 1b. If v < 8, (whence m, exists), let w be the 
uniquely determined integer satisfying the relations s,.1. =v <s,. Then 
1=wzfé. According to the part of the lemma already established, 


Te = (y My 
Moreover, 


m, t,, (1d). 
Hence 


¢ 
Yo Yon — ™,t, 


v=o+l 


v=o+1 
= (y 8) Me. 


Remark. If v > y, — Ye, <(¥- m, (and, in fact, < (v — 
for every m > mz41); and if »v < 8, y, — Ye (and, in fact, 
< (v — 8.) m for every m < mz). 


5. THE SUM OF TWO CHARACTERISTIC SETS 


Let A = (ao, and B = (bo, bi, ---, be two parentheses 
of ©, a, and £, being the rank of a, and b, respectively; furthermore, let 
ay + — © and By + — o. Then, according to the previous section, the 


characteristic sets 
Sa = {(m,t), (m,, t,)} 

* The occurrence of me (me,, ) in the lemma is not to be interpreted as implying that me (me,, ) 
exists; for it is our purpose, as shown in the formulation of the lemma, to include also the cases 
¢ = 0, when mp does not exist, and ¢ = 4, when Me does not exist. If m, (mg,1) does not 
exist, every relation rendered meaningless because of this non-existence is regarded as being 
fulfilled vacuously. Thus, if ¢ = 0, mo is non-existent and mp > — 1 is regarded as true. 
On the other hand, every relation that has meaning requires proof. In the light of these 
remarks, similar situations below—and a few above—are to be interpreted. 
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and 


Sp = )} 


exist. The sum S, + S, is defined (for r=1 and s 21) as a new set of 


number pairs 
Se {(m, ti), (ma, tr)}, 


the m’s and the ?#’s being defined in the following manner:* The sequence 
{m,, m2, +++, mp} is obtained by arranging the (single) set of numbers 
m, {v= 1,2,---,f} and m; =1,2,---,g} in a decreasing sequence 
according to magnitude and discarding duplicates. With each m associate 
the number ¢ = ¢), or t = ¢/ that corresponds to it in either S, or Sz if there 
is just one such correspondent; if there are two, t), in S, and ¢, in Sz, asso- 
ciate with m the number ¢ = ¢, + ¢;. Furthermore, S, + Sz is defined to 
be Sz or S, according as S, = 0 (which is the case when r = 0) or Sz = 0 
(which is the case when s = 0). 
From the definition of S, + S, it follows that to every segment 


So, {(m, ti), (mg, 
of where = 0,1, 2, ---, h, there correspond uniquely two segments 


Sa, ti), (mt, t;)} 
of S, and 

Sz, {( my ti), (m;, 
of Sz, such that Sa, +S, = Sc, . Two segments S,, and Sz _ thus deter- 
mined are said to be “ conjugate.” There are thus in all h + 1 pairs of con- 
jugate segments. 

Two conjugate segments S4 ‘ and Sg are characterized by the property that 
me > m,,, and m, > mey1. (The null-segments and the segments S4, = Su, 
S,, = Sz share this property on account of our agreement regarding vacuous 
validity.) 

It is evident that the h + 1 pairs of conjugate segments may be so ordered, 
—and in only one way —that £ = ¢’ and 7 =7’ (and furthermore, of course, 
either — < £’ or.» < n’), whenever the pair (Sa, , Sz, ) precedes the pair 
Sz,,). Then (S4,, is the first pair, and (S,,, = (S,, Ss), 
the last. 

We introduce here, with regard to conjugate segments, a lemma that will 
simplify the proof of the ‘‘ Theorem on the Product.’”’ This lemma gives us 
desirable information regarding the comparative values of the expressions 
—o and ae, + Bs” (see $2). Here s;, stand, of course, for 

v= 

v=l “y 

* The designation Sc is not meant to imply here the existence of a parenthesis C having S¢ 
as characteristic set. 
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Lemma II. Let A(ao + — ~) and B(By + — ~) be two parentheses 
of S; S4,, 8p, , two conjugate segments of S,, Sz; and X, mw, o integers such 
thtO0 =X Set 


5=A+yu+o— 
A=a,+8B, — (a, + 


and 


Then 

(a) if > and m = max (mp1, m,,1), A S bm; 

(6) if5 =0,m > —1,m, > —1land[d, yu, 0] #[H%,8,,0],A<0. 

Proof of (a). On account of our agreement regarding vacuous validity, 
there is nothing to prove if both m;;1 and m, +, are non-existent.* On account 
of symmetry, we may then assume without losing generality that m;,: exists 
and =m, 41. From Lemma I, we see that a, — a, S (A — 8¢) further- 
more, since S,, and S, are conjugate, m, > mz; = m,4; and therefore 
B, — Br )me+1; finally, since — 15 —o Ac- 
cordingly, 
8, +0) mes = im. 


Proof of (b). We suppose that m; and m, both exist, the requisite modi- 
fication of the argument being evident when this supposition is not fulfilled. 
Without loss of generality, we may assume, on account of symmetry, that 
m; =m,. We divide our discussion into two parts according as ¢ > 0 or 
¢=0. 

If ¢>0, —o <om,, since —1<m, by hypothesis. Furthermore, 
since S,, and Sz, are conjugate segments, m; = m,’ > m;41; hence (Lemma I) 


ay Os, = (Ar 8;) mM, . 
Since, finally, 8, — 6.” = (u — 8, )m, , we have 
A<(A-—s +0)m, =0. 


If ¢ = 0, it follows from [A, u,o] #[s;, 8,,0] and 6 =0 that either 
(1) — > O and — 8,’ < 0 or (2) A — 8; < O and 8, >0. In the 
first case, mz11 exists and (Lemma I, Remark) 


Ay Qs, = (A < (A . 


Since, moreover, 8, — = (u — it follows that 


* As indicated in a previous footnote (§ 4), it is not presupposed in Lemma II that me, 
Mei1, mM, ,m,,, exist. It is our purpose to include in the application of the lemma the cases 
€=0,7=0,&=f,n=g. If m;z,, exists,but m,,, does not, it is to be understood that 
max m, i= 41} Similarly, if exists and +1 does not. Since the relation 
A <0 has meaning in (b), even when ms or m,, or both are non-existent, it requires proof also 
in the latter cases. 
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=0. 


In the second case, 6, — Bs’ < (u — s,)m, (Lemma I, Remark). Since, 


moreover, 
— Oy, S — 8¢) me S (A — m, 


(on account of \ — s; < Oand m,’ = m,),A <0. 
n 


6. Tue “ THEOREM ON THE PRODUCT ”’* 


THeorEM. If A(ao+— ~) and B(Byo + — ~) are two parentheses 
of Sand C = A- B, then S, = 8, + Sz. 


Proof. C is a parenthesis of S by virtue of § 2. Moreover, 
(1) Yo = a + Bo. 


For, if in (b) § 2 we set vy = 0, it follows, on account of O=A,0Sy,08c, 
that X= 0, » = 0, o = O for every [A, uw, hence there is no 
[\, =[0,0,0]. The hypothesis of §2 is 
therefore fulfilled vacuously, and in consequence (1) holds. From (1) it 
follows that 7o is finite and accordingly that S, exists. 

The proof that S, = S, + S, is effected by means of mathematical induc- 
tion. We thus assume that it has already been established for the conjugate 
segments Say» Sz, of S,, S, that a segment Se, of S, exists such that 


(2) Sey = Sa, + Sz, 


Since Se, = S4, + Sz, (each term in this equation being the null-set), the 
theorem will be completely proved, if, on the basis of the assumption (2), 
we show that mz, and t,4; have values in accordance with the implications 
of S, = S,+8,. (Compare the ordering of the 4 + 1 pairs of conjugate 
segments, mentioned in § 5.) This is what we shall do. 

From (2) it follows, in accordance with the definition of the sum of two 
characteristic sets, that 
(3) =a +s,. 


There is nothing to prove if neither m4; nor m,.,; exists; for then, according 
to (3) and § 2, : 

&=8s +s =r+s=n, 
and therefore m,,; does not exist. Hence we may also assume, without loss 
of generality, that m; > — 1 and m, > —1. For suppose m; = —1 (a 
similar argument will hold if m, = — 1); then mg,; does not exist. More- 
over, on account of the conjugacy of S,, and Ss, . 


* This designation was suggested by Professor E. W. Davis. 


| 
| 
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, 


Hence m,',, does not exist (or else we may conclude that —1> —1).* 
From m; > — 1 and m, > — 1, it follows, however, by virtue of 1d, § 4, 
and the relations (1) and (2) of the present section, that 


(4) Yee = Oe, + Bs”. 


On account of symmetry, we may furthermore assume, without loss of gen- 
erality, that m4: exists and = mj,,,. We now divide our discussion (in 
natural manner) into the following three parts: 


(a) mi41 = —1. According to (a) Lemma II, if 
= 8, 
we have, since 6 = v — 8¢, ae, + Bs’ = Yee; and m = m1 = — 1: 


a + (v — &). 
Hence, by virtue of (a) § 2, y, S Ye — (v — 8) for every v > 82; i. €., 


Yr — Yee 
> 8} =-1. 
According to the definition of “ characteristic set,” this shows that m1 = — 1 
and that is, by (3), = or + according 
as m,,; is non-existent or existent (the value of m,',, being — 1 in the latter 
case). 
(b) > —1 and > (mj, being existent or non-existent). By 


means of (a) Lemma II, we show, as in the case m:41 = — 1, that 
Yee 
— & {y > = Me+1 


Moreover, the segments S,,_, and Sp, are conjugate; for, in addition to 
Mey > m,,1, we may conclude, on account of the assumed conjugacy of S,4, 
and S, and 1b, §4, that m, > mgy1 > mgy2 (the latter relation holding 
vacuously when m¢+2 does not exist). It therefore follows from (6) Lemma II 
and (b) § 2 that 


{v = S41 +8,} = + Be = Yee + = Yo, + 8¢) 
(1d, § 4 and (4) of present section), 
whence 
Ys 


¢ ” , 
= 8. = 
{v = S41 + 8,} = 


al i ms does not exist (as when = 0), the contradiction — 1 > — 1 cannot be obtained 
even if Ma exists; but in that case, me > — 1 holds vacuously. 


| 
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In view of the definition of “ characteristic set,’”” we may now conclude that 
= Mez1- If neither mgy2 nor m,,, exists, is evidently and there 
is nothing more to prove. We may therefore assume without losing generality 
that either or m,,; exists; hence m = max(me42, m,,1) exists. Ac- 
cording to (a) Lemma II, we then see, in virtue of the conjugacy of S,,,, and 
S,,, that fory =A + u+o > +8, , we have 


+ By — + By + (v — Si41 — 8, )m. 
But 
Be” = ae, + Be” + tey1 Mey = Ye, + 


(1d, § 4, since > — 1), 
by (4); furthermore, 


(v — — 8, )m < (v — 8441 — meq, 
since vy — 8:41 — 8, is positive and both m¢42 and m,,, are < mz41. Therefore 
a +6,-¢< + (7 8, ) = Ye, +(v- Me41, 
by (3). Consequently, in accordance with (a), § (2), 


Ye < Yap + (v — 
whence 
{v > 841 + 8, } < 
y= 8¢ 
We now see that 8.41 = 8:41 + 8, and hence tgy1 = tey41. 
(c) m1 = m,,; > — 1. As in the previous case, 


Ye ” 
{vy > 8 = & +8,} S 


moreover, by means of (b) Lemma II for the conjugate segments S,,,, and 
Sp, 41 and (b) § 2, it follows that 

Ye , , 
— & {vy = Sep + $y41} = Me+1; 
and finally, by means of (a) Lemma II for these segments and (a) § 2 (as in 
the previous case), that 

Vv Ys , 

{y > + < Me+1 + 
y= 

From these relations we conclude that = and that = thi 
thus completing the proof of the theorem. 


1916] FACTORIZATION OF EXPRESSIONS 533 


An immediate consequence of the ‘‘ Theorem on the Product ”’ is the 

“THEOREM ON Common Drvisors.” If the parentheses A(ao + — 
and B( By + — ©) are such that no m; {§ = 1, 2,---, f} is equal to an 
m, {n = 1,2, -++,g}, they have no common factors of order = 1. 

For every m of the characteristic set of a common factor is an m of both 
S4 and Sz. 

7. THE “ FACTORIZATION THEOREM” AND THE ‘“‘ 

IRREDUCIBILITY ” 


CRITERION FOR 


If the parenthesis C is such that y, {vy = 0,1, ---, m} is an integer, which 
is the case in nearly all of the applications, m; {f = 1,2, ---,h} is a rational 
number, being equal to (7; or — 1 by 1d, § 4. We then intro- 
duce the following 

Definition of +, and of d;. ‘The number 7; is the (positive) denominator of 
m; when m, is written in its lowest terms, and t; = d; r;. 

Suppose now that C is decomposed into the product A - B of two paren- 
theses, A being subject to the further restriction that a, {A = 0,1, ---, r} 
is an integer. (B may be treated as we are going to deal with A.) Let 7; 
and d; {§ = 1,2, +--+, f} be defined for A as 7, and d; were for C. From 
the “Theorem on the Product,” it follows that every m; is an m,; hence 
every 7; is a tz. The “Theorem on the Product” tells us furthermore, 
that if m; = mz, then t; St,; hence d; =d,;. Therefore, 


r= = d; = Te, 


where yz is an integer such that 0=»,=d,;. We have thus proved the 
following 

“ FacTorizATION THEOREM.” If C is such that y, {vy = 0,1, ---, is 
an integer (the value — ~ being allowed except, of course, for yo) , then the order 
of a factor A of C, where a, {X =0,1, likewise an integer (or — 
is of the form 


h 
Tr 
=1 


where v, is an integer such that 0 S vz S d; (and r,, d; are defined as above). 

In particular, the ‘‘ Factorization Theorem ” leads to the following 

“ CRITERION FOR InREDUCcIBILITY.” If the ranks of the elements of S are 
all integers (or — ©), and (yn — Yo)/n is a fraction in its lowest terms > — 1 
and also > (y, — Yo)/v {vy = 1,2, +--+, — 1}, C ts irreducible. 

For in this case, the characteristic set of C consists of just one pair 


(m1, = — Yo/n, n) 
Trans. Am, Math, Soc, 35 
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and moreover, since (Yn — Yo)/m is in its lowest terms, 7; = 4) = n and 
d, = 1. Hence, the order of a factor A of C is, according to the “ Fac- 
torization Theorem,” of the form 


or n. 


The difficulty of obtaining for T_,; a more exhaustive criterion than the 
“ Criterion for Irreducibility ”’ may be judged from the following 

Remarks. There cannot be a theorem for T_, that asserts the irreducibility 
of even a single parenthesis having more than one element in its characteristic 
set, unless the theorem asserts nothing for at least one of the situations of § 3. 
For according to a result of Dumas,* based on more general considerations of 
Hensel,{ every parenthesis of Situation 2 having two or more elements in its 
characteristic set is reducible; that is, in Situation 2, C is reducible unless either 


— 1 < — Yo)/n = (W — Yo)/v {v=1,2,---,n—1} 


Sc = {(m,t:)} = {(—1, n)}. 


Furthermore, the “ Criterion for Irreducibility ” becomes false, if the con- 
dition that (7 — Yo)/n be a fraction in its lowest terms is not demanded. 
In fact, for every n and y, — Yo that are not relatively prime, there is a C 
belonging to Situation 1, for instance, that satisfies every other condition of 
our criterion and is nevertheless reducible, as the following example will show: 
Let n = kd and Yn — Yo = Id, k, 1, d being integers such that d > 1 (and 
1=0). The parenthesis C = y*¢ — p~ of Situation 1 has the factor 
y* —p', although yo =0, = — ©, Yo = — ©, Yn = — 
Ts * ld, (Yn — Yo)/n (Id/kd) > = vo)/v {v =1,2,---,n— 1}. 
Finally, our criterion becomes false if the condition (yn — yo)/n > — 1 is 
made milder to the extent of replacing 1 by a number e > 1, however small 
¢—1 may be. (If (Yn — yo)/n = —1, nm must be 1 on account of the 
relative primeness of n and y, — Yo, and our criterion is trivial.) This is 
shown by the following example in Situation 6:{ Let 


or 


d"y dy 
te y. 
Here = 0, @, °**s —- Fat = 
Yn = Yo)/n = 2, 
*L. c., p. 223. 


t Neue Grundlagen der Arithmetik, Journal fiir Mathematik, vol. 127 (1904), 
pp. 51-84, Section 4. 

t It will appear in the next section that no such example is possible for Situations 1, 2, 
3, 4, 5, 8, or 9 of §3, since these situations (as will appear) are instances of the Theory 
T-«.. As a consequence, the “ Criterion for Irreducibility’” holds for these situations if 
(yn — yo)/n > — 1 is replaced by (yn — yo)/n > — ©. 


= 
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-,m— 1}. However small « — 1 may be, n may be so chosen that 


n 


Still, C = A - B, where 
A=dy/dx and B= d™" y/dx** +a" y. 


Clearly the “ Criterion for Irreducibility ” is not improved by replacing 
(Yn — Yo)/n > (¥, — Yo)/v by (Yn — Yo)/n = (Y — Yo)/v; for the former 
condition is a consequence of the latter and the condition that (yn — yo)/n is 
a fraction in its lowest terms. 


8. THE ABSTRACT THEORY T, 


In his 1900 Mathematische Annalen paper, Ueber die Irre- 
ductibilitét algebraischer Functionalgleichungen, Koenigsberger deals with a 
situation, which, if slightly modified to conform to our point of view, does not 
fall under the theory T_,. This situation, which we designate in accordance 
with our previous numbering as Situation 10, is essentially* as follows: 

Situation 10. © consists of the collectivity of fractions e = e’/e’’, where 
e’ and e” are power series in x — 0 (= 2%, ef. Situation 5) with arbitrary 
complex coefficients, being constant with reference tox. As in Situation 5, 
the rank of e(+ 0) is defined as d’ — d’, where d’ and d” represent the 
lowest degrees (in the usual sense) in x — @ = 2, of a non-zero term of eé’ 
and e”’ respectively; the rank of 0 is — ©. The parenthesis (é, ¢1, «++, 
is the linear homogeneous differential expression 


d™ 


The product of two parentheses is defined as in Situation 6. 

If we compare Situation 10 with the illustrative example of §1 (that is, 
with Situation 6), we see it to be no longer true, as it was there, that the 
rank of de(2)/dz is always less than or equal to the rank of e(2); the rank 
of x — 0, for example, is — 1, while the rank of d(2 — @)/dris 0. It is 
true, however, as may be verified in elementary fashion, that the rank of 
de (x)/dx is at most 1 more than the rank of e(z). This new state of affairs’ 
requires a corresponding modification of the abstract theory T_,. A natural 
modification leads to a theory, called—with propriety in the light of the defini- 
tion of T, below— 

Tue Apstract THEeory Modify the Theory by replacing in (a) 
and (b), § 2, the expression a, + B, —o by a, +B, +0, leaving everything 
else unaltered; the resulting theory is T41. 


* Situation 10 is slightly more general than Koenigsberger’s situation. 


-. 
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The reader may easily verify that Situation 10 is an instance of the Theory 

Again, we proceed to show how another theory, properly called T_,, 
naturally arises. It is easy to see that most of the situations of § 3 satisfy 
more restrictive conditions than (a) and (b) of §2. In fact, the reader may 
verify that Situations 1, 2, 3, 4, 5, 8, and 9 actually satisfy the following 
conditions: 

(a) = max 

+u=v 


every [A, #[A, 4], 
It is evident, if we agree that — © - 0 = 0, that these new conditions (a) 
and (b) are equivalent to the following: 
(a) = max 
+uto=v 


(6) 
= v} for every [A, wu, #[A, 4,0], {(OSAS7r,0S 458,050}. 

We thus arrive at 

Tue Asstract Toeory T_,. Modify the Theory T_, by replacing in (a) 
and (b), §2, the expression a, + B, —o by a, +8, — © +o and agree that 
— © +0 = 0; the resulting theory 1s T_.. 

Situations 1-5 incl., 8 and 9 are instances of the Theory T_... 

The considerations above lead to the idea of 

Tue Asstract THEory T, FOR EVERY REAL NUMBER € (AND FOR€ = — ©). 
This is obtained from T_, by replacing in (a) and (b), § 2, the expression 


by +e. 


It is evident that the Theory T, is a T. fore’ >. 

In the Theory &,, “ characteristic set,” the “sum of two characteristic 
sets,” “ conjugate segments ” and the numbers 7, and d; of § 7 may be defined 
similarly as in the Theory T_,. 

Thus, the “ characteristic set” of C in the Theory Z, is defined as follows: 


Mei = 


Fo = Va 
= last v> 8¢ such that = Me415 af Me+1 and 


So41 = nm, Uf = 


It is readily seen that the relations 1, § 4, hold in Z, if the last relation of (b) 
is changed to m, = € and in (d) m; > — 1 18 changed to mz > €. Likewise, 
Lemma I holds if m; > — 1 is replaced by m; > €, and the proof is precisely 
analogous. 


te F 
max ( 
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The “ sum of two characteristic sets’ and “ conjugate segments” are defined 
in , precisely as in §5. Lemma II holds in &, if A is set equal to 


a, + B, (a, + Bs’), 


and in (b) m > —1, m,; > — 1 are replaced by m; >€, m, >, and the 
proof is precisely analogous. 

From these considerations, it is evident that 

The “‘ Theorem on the Product” and the “ Theorem on Common Divisors” 
hold for E,, and the proof is entirely analogous to that of § 6. 

In trying to carry over the results of § 7 to the Theory T,, we at first note 
a slight divergence; for even if the parenthesis C is such that y, {vy = 0,1, 
-++, mn} is an integer, m, need not be rational if equal toe. The divergence 
disappears, however, and in natural fashion, if we define 7; to be equal to + 1 
in case mz = €, whether ¢ is rational or not.* In other cases (m,; > €), Tz 8 
defined as in §7 and d, (as there) always by the equation t; = d;r;. It then 
appears that 

The Factorization Theorem” holds in and 

The “ Criterion for Irreducibility ” holds in &, if the relation 


— Yo)/n > —1 
is replaced by the relation 
(Yn — Yo)/n 


The remarks at the close of §7 show the difficulty of obtaining a more 
exhaustive criterion for irreducibility in fT, . 


APPENDIX. CONNECTIONS WITH THE LITERATURE; CONCRETE EXAMPLES 


(A) The Schoenemann-Eisenstein theorem says that 


where the 7’s are integers and 2, not divisible by the prime integer 7, is irre- 
ducible in the domain of rational numbers. This theorem is a special case 
of the “‘ Criterion for Irreducibility ” in the Theory T_,, (or in T, fore < — }), 
when interpreted for Situation 1, §3. For, C = (co, ¢1, +++,¢n), where 
= In D; hence Yo =0, 1, 
Y-1 = —1,% = —1. Therefore, 


* The reader will remember that in case m; = e, we have only an inequality 
( Yee = me te) 
for Ys g instead of the usual equality. 
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(¥n — Yo)/n = > — Yo)/v {vy =1,2,---,n—1} 


and is in its lowest terms.* 

The irreducibility of y? ++ y7*+--- +y+1 (p =a prime integer) 
in the domain of rational numbers is a direct consequence of the Schoenemann- 
Eisenstein theorem. 

(B) The Floquet Theorem{ says that the linear homogeneous differential 
expression 


C = (x 0)" po(x) + (2 — py (2) 


d 
+ (a — (a) + 
is irreducible under the following conditions: 


1. C is said to be irreducible, if it is not expressible as a product of two 
linear homogeneous differential expressions of order = 1, whose coefficients 
are power series in x — @ with arbitrary complex numerical coefficients. 

Mi, Mes °** Mn—1 are all = 1. 

3. The p’s are power series in  — @ such that po(0), p1(0), «++, pn(@) 
are all + 0. 

The theorem is a special case of the “ Criterion for Irreducibility ” in the 
Theory T,,, when interpreted for Situation 10, §8. For, we have 


Yo= —(n+1), m= —(n-1+4m), 


= — (1 + = 0. 
Hence 


(vy, — Yo)/v {y=1,2,---,n—1} 


< (n+1)/n = — Yo)/n, 
a fraction in its lowest terms. 

(C) Koenigsberger, Journal fiir Mathematik, vol. 115, pp. 
53-78. Every result of this paper (pertaining exclusively to the reducibility 
of expressions) falls under our theory. We select several examples.$ 

(a) P. 55... “‘ Wenn in einer algebraischen Gleichung 


C = + (e)y™* + +en(z) = 0, 


* By noting that C is irreducible when and only when (cy, ¢n-1, ++, Co) is irreducible— 
this is, of course, not always so in the abstract theory—we may show that the Schoenemann- 
Eisenstein theorem is a consequence of the “ Criterion for Irreducibility ” in a T.,«=0, 
interpreted for Situation 1, Section 3. A similar remark applies to other examples of the 
appendix. 

t Eisenstein, ce. 

t Contained in Koenigsberger, Mathematische Annalen, vol. 53 (1900), p. 67 

§ In the quoted theorems of the appendix, slight deviations from the original are made 
when greater harmony in statement with the rest of this paper may be thereby secured. 


> 
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in welcher ¢o(x), +++, ganze Functionen* von bedeuten, 
alle Coefficienten mit Ausnahme des ersten durch einen Linearfactor x — 0, 
der letzte aber nicht durch (x — @)* theilbar ist, so ist die Gleichung irre- 
ductibel.” 

This theoremf is a special case of the “ Criterion for Irreducibility,” T_.. , 
Situation 5, § 3, & being taken equal to 1, and 2; set equal tox — 06. It is 
seen that yo = 0,71 S —1, 72 S —1,°°+, %-1 S — 1, Yn = 1, a8 in (A). 

(b) P.63. THurorem. Every polynomial of the form 


C = po(z)y" + — py (x) + — pa (2) 


is irreducible under the following conditions: 

1. The p’s are polynomials in xz, and the domain of rationality is the set 
of rational functions of x with arbitrary complex coefficients; p9(@) + 0 and 
+ 0. 

2. 7 is a positive integer relatively prime to n; E(t) signifies, as usual, the 
largest integer St. 

The theorem is a special case of the “ Criterion for Irreducibility ” in T_,, 
Situation 5,k = 1,2, =2-—-—6. For, if y, represents the rank of the coef- 
ficient of we have yo = 0,91 S — E(i/n) S — E(2t/n) -1, 

Yer — E{(n—lt/n)—1, —1; (Yn — Yo) = —t/n, 
fraction in its lowest terms and furthermore (Yn — Yo)/n > (¥, — Yo)/v 
{y=1,2,-+-,n—l1}. 

(c) P. 77. “ Jede algebraische Gleichung n-ten Grades von der Form 


(a (1) 2) (2) 


in welcher n = uw - v, uw, und p relative Primzahlen, p und q zwei verschiedene 
Primzahlen, ip, 71, «++, in beliebige ganze Zahlen, von denen nur i und % 
weder durch p noch durch q theilbar sind, 

= E(p—l1/u)+1, = E(p—1/r) +1, 


also x = v, kK = yp ist, endlich das Symbol (p/o) die Einheit oder Null 
bedeutet, je nachdem p durch o theilbar oder nicht theilbar, ist irreductibel.” 

The theorem follows from the “ Factorization Theorem,” T_,,, Situation 1. 
For, let y, be the rank of the coefficient of y"~” with respect to the prime p; 


* By “ganze Functionen,”’ Koenigsberger means polynomials; the theorem is also true, 
however, when the c’s are power series in  — 0. 

+ The domain of rationality is not given explicitly in the statement of the theorem; however, 
the subsequent statement that the theorem comes under Situation 5 supplies the deficiency. 
A similar remark applies to other examples of the appendix. 


~ 
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y,, the rank of the same coefficient with respect to the prime g. We then 
have: 


yw=0, —K? —(2/p), 


(¥n — Yo)/n = 1}. 


The characteristic set (with respect to p) consists of the single element 
(m,t:) =(—v/n,n); =p and Hence, according to the 
“ Factorization Theorem,” the order r of a factor of C is of the form r = iy, 
where 7 is an integer. Similarly, by reasoning with the ranks +, of the coef- 
ficients with respect to g, we conclude that r = jy, where j is an integer. 
Since yw and » are relatively prime, r must be of the form xuv = xn. Hence, 
r = 0 or n; that is, C is irreducible. 

Example. 

+PPiytp =0, 

io and % being divisible by neither p nor q. 

It is evident that a more general theorem may be at once obtained by 
taking & primes (k arbitrary) instead of the two primes p and q. 

(D) Nettoop Mathematische Annalen, vol. 48, pp. 81-88. 
All the results may be treated from our point of view. We select several 


examples. 
(a) Pp. 82-83. THEeorem. The polynomial 


yr + ay py bain py™ pr 


where p is a prime, i, {v = 1,2, +++, 2u + 1} an integer and ton, not divisible 
by p, ws irreducible. 

The theorem is a special case of the “ Criterion for Irreducibility,” T_.., 
Situation 1. 

(b) P. 85. ‘‘ Wenn im Polynome 


alle Coefficienten c, durch p theilbar sind, c,_; aber durch keine héhere Potenz 
von p als die erste, dann ist es entweder irreductibel oder es zerfallt in der 
Weise, dass es einen Factor des Grades 1 und einen anderen irreductiblen 
Factor des Grades n — 1 besitzt.” 

The theorem is a special case of the “ Factorization Theorem,” T_,, 
Situation 1. For 


yo = 0, n=a-l, Yn Ya 1; 


== - 
= 
eee 
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— Yo)/n S (%n-1 — Yo)/(n — 1) > (% — Y0)/¥ 
1}. 


It is hence evident that the first pair (m1, t,) of the characteristic set of C is 
either (— 1/(m — 1), m — 1)—and then #, of the second pair equals 1—or 
(—1/n, n). Therefore, according to the “ Factorization Theorem,” the 
order of a factor of C is of the form i(m — 1) + 7 or in, where 7 and j have 
the values 0 or 1. The theorem is thus proved. 

(E) Perron, Mathematische Annalen, vol. 60, pp. 448-458. 
Every result falls under our theory. We select several examples. 

(a) P. 452. “ Die algebraische Gleichung 


WO 21, «++, % beliebige ganze rationale Zahlen bedeuten, deren letzte zu den 
Primzahlen p;, po, +++, pe prim ist, wo ferner die k +1 Zahlen n, e1, e2, 
-++,é, keinen gemeinsamen Teiler haben, ist irreduzibel.” (The e’s are 
> 0.) 

The theorem is a special case of the “ Factorization Theorem,” T_,, 
Situation 1. For, let y,,, {vy =0,1, ---,; o =1, 2, «++, k} be the 
rank of the coefficient of y*~” with respect to the prime p,. Then 


= 0, = — — — (2¢,/n) — 02,6; 
Yr-1,¢ = — ((n —1)e,/n) — On-1, 6 The 
where the @’s lie between 0 (exclusive) and 1 (inclusive); 


(Yn, ¢ — Y0,¢)/n = (e,/n) > (tne — {vy = 1,2, — 1}. 


It then follows from the “ Factorization Theorem,” the ranks of the coef- 
ficients being taken with respect to p, , that the characteristic set consists of 
the single element (— e,/n,n). Hence, the order r of a factor of C has the 
form (in/[n, e,]), where 7 is an integer and [n, e,] = G.C.D. of n and e,. 
Therefore r[n, e,] is a multiple of n foro = 1,2,---,k. Since the G.C.D. 
of the k numbers [n, e,] is 1 by hypothesis, there exist & integers 7, such that 

Hencer = e,] is a multiple of n; that is, 
r=OQorn. 


(b) Pp. 455-456. “ Das Polynom 


oder allgemeiner 


541 
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wo % und 7, durch die Primzahl p nicht teilbar sind, und wo e zu n — k relativ 
prim ist, enthalt mindestens einen irreduzibeln Faktor, dessen Grad = n — k 

A special case of the “ Factorization Theorem,” T_,,, Situation 1. 

(F) Koenigsberger, Mathematische Annalen, vol. 53, pp. 
49-80. Every result (within our present field of investigation) is a special 
case of the “ Factorization Theorem ” either in T,,; for Situation 10 or in 
T_.. for Situation 5,4 = 1,2; = We select several illustrations. 

(a) P. 69. “ Die lineare Differentialgleichung 


d” 
C = (x 0)" po(z) + (2 — pi (2) 


d 
+ (x — (x) + = 0, 


in welcher ***, = 2 sind, ist, wenn die Ordnung n derselben un- 

gerade, stets irreductibel; ist die Ordnung jedoch gerade, so kann sie nur mit 

einer gleichartigen linearer Differentialgleichung n/2 ter Ordnung die simmt- 

lichen Integrale der letzteren gemein haben.” (Here the p’s are power series 

in (« — @) such that po(@), ---, pn(@) are all + 0; cf. (B), appendix.) 
This theorem falls in T,,, Situation 10. We have 


Yo= —(n+2), Yn = 0; 


(¥n — Yo)/m = (n+2)/n > (v +2 = (w — Yo)/¥ 
fy 1,2, 


Hence, according to the “ Criterion for Irreducibility,” C is irreducible if n 
is odd; and according to the “ Factorization Theorem,” if n is even, the order 
of a factor of C is of the form in/2, where 7 is an integer. This result is 
equivalent to the statement of the theorem. 

(b) P. 73. ‘ Eine Differentialgleichung von der Form 


d” 


d 
+ py (2) = 0, 


in welcher py, Sind, kann nur mit einer gleichartigen 
Differentialgleichung, deren Ordnung = n/yp ist, alle Integrale der letzteren 
gemein haben.” (Here the p’s are as in (a).) 
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A special case of the “ Factorization Theorem,” T,,, Situation 10. 
(c) P. 77. “ Die lineare Differentialgleichung 


d” 
av d 
+ (2 — py (2) OP (a) 


+ pn(x)y =0, 


in welcher 71, = 1 und p eine beliebige positive ganze Zahl 
bedeutet, ist irreductibel.” (Here the p’s are as in (a).) 

A consequence of the “ Criterion for Irreducibility,” T 1, Situation 10. 

(G) Koenigsberger, Journal fiir Mathematik, vol. 121, pp. 
320-359. The leading results on factorization contained in this paper—in 
particular, the (complicated but general) result p. 342—fall under. the “ Fac- 
torization Theorem,” T_,., either for Situation 1 or for Situation 5, k = 1. 

(H) Perron’s Criterion, Journal fiir Mathematik, vol. 132, 
p. 304. 

“Wenn sich unter den respektiven Gradzahlen 1, 2, «++, Yn der Funk- 
tionen c; (a), ¢2(2), +++, n(x) eine findet (y;), welche den Ungleichungen 


(2) 
(8) 
geniigt, wo v jede von 7 verschiedene der Zahlen 1, 2, ---, m bedeutet; und 


wenn ausserdem 7; relativ prim ist zu 7, so ist 
C=f(z,y) + + 


irreduzibel. Sind aber die Ungleichungen (a) nur insoweit erfiillt, dass fiir 
gewisse Werte von v das Gleichheiszeichen an Stelle des Grésserzeichens tritt, 
so ist fiir die Irreduzibilitét ausserdem noch erforderlich, dass keine Konstante 
c existiert, fiir die f (2, c) als Funktion von z identisch verschwindet.” (The 
c’s are here polynomials in x with arbitrary complex coefficients.) 

This theorem is a consequence of the “ Theorem on the Product,” T_., 
Situation 3, k = 1, 21 = 2. For, from (8) it follows, since yo = 0, that 
the first element (7m, t,) of the characteristic set of Cis (y:/i,1). Ift=n, 
it follows from the “ Criterion for Irreducibility,” since y; and 7 are relatively 
prime, that C is irreducible. If i < n, the characteristic set of C has other 
elements (m,;,t;), where § > 1, but on account of (a), m,; must be < 0. 
Now, according to the “ Theorem on the Product ” [cf. proof of the “ Fac- 
torization Theorem ”], it follows in virtue of the relative primeness of ¥; 


= 


544 HENRY BLUMBERG 


and i, that only one of the factors, say A, of C= A- B(r21ands21) 
can have m; = 7;/i as an m of its characteristic set; and every m of the char- 
acteristic set of B is thus <0. According to a theorem of Gauss, we may 
assume, without loss of generality, that the coefficients of the various powers 
of y in B are polynomials in x and hence of rank =0. Since fo ( = rank of 
the coefficient of the highest power of y in B) is then obviously 0, at least 
one m of the characteristic set of Bis =0. This contradiction shows that C 
is irreducible. If, on the other hand, the inequality (a) is changed to y; = v,, 
we have no longer a contradiction, but can merely conclude that every m of 
the characteristic set of B is 0. Hence the rank of every coefficient of the 
powers of y in B is 0; that is, these coefficients are constants. Therefore, B 
depends on y alone, and there exists a value c for y, namely a root for y of 
B=0,such that C =f(z,c)=A-B=0. 

(1) Dumas, Journal de Mathématiques, ser. 6, vol. 2, 
pp. 191-258. The theorem p. 217 is equivalent to the ‘ Theorem on the 
Product ”; the theorem p. 237, to the “ Factorization Theorem ”; and the 
theorem p. 242, to the “ Theorem on Common Divisors,” all for T_,, Situ- 
ation 2. 

(J) Bauer, Journal fiir Mathematik (see previous citations). 
The principal results come under the results of Dumas. 
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THE FORMAL MODULAR INVARIANT THEORY OF BINARY 
QUANTICS* 
BY 


O. E. GLENN 


The group, G, represented by the general binary linear transformation 
in which the coefficients are parameters representing residues of the prime 
number p, has the universal covariantst 


a fundamental system being given by L = I,, Q = I,/I,. Assume that 
fm = (do, 1, a1, 


is a binary quantic of order m whose coefficients are variables. We study the 
functions of the a’s and z’s which are invariantive under the operation of 
transforming fn by G. Sections 1, 2, 3 deal with methods of constructing 
such functions, especial attention being given to the case p= 2. In §5 
fundamental systems of first degree concomitants of the quartic and quintic 
are derived. Section 4 is devoted to a proof that if the systems of concomitants 
modulo 2 of the forms of orders 1, 2, 3 and the simultaneous systems obtained 
by combining forms of these three orders are all finite, then the system of f,, is 
likewise finite. The hitherto undemonstrated theorem on the finiteness of 
the formal concomitants modulo 2 is thus reduced to a comparatively simple 
problem. In §6 there is proved a theorem on the reducibility of any covari- 
ant, modulo 2, of the binary cubic in terms of a set of fourteen invariants 
and covariants. 


1. INVARIANT OPERATORS 


In addition to modular polars and transvectants previously discussedft 
by the present writer we cite, in order to augment the number of construction 


* Presented to the Society, February 26, 1916. 
+ Dickson, these Transactions, vol. 12 (1911), p. 75; and Madison Colloquium 
Lectures, 1913, p. 33. 
tO. E. Glenn, American Journal of Mathematics, vol. 37 (1915), p. 73; 
and Bulletin of the American Mathematical Society, vol. 21 (1915), 
p. 167. 
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methods, the universal operators obtained by replacing, in LZ and Q, the 
variables 21, 22 by the cogredient elements 0/0z2, — 0/dx,. Call these 
operators respectively L; and Q;; 


If Ky is any formal covariant modulo p of degree-order (7, M) and if 


(1) (p+1)r+p(p—1)s+w=M, 
then 

is a formal covariant of degree-order (1, w). The number of concomitants 
yielded by this formula equals the number of distinct solutions in positive 
integers (r,s, w) of the linear diophantine equation (1). The lists given 
below are for p = 2, Ky = fm, each giving all forms C,,.. for the corresponding 
m. Corresponding to each invariant C,.9 there exists an invariant operator 
>> (8/da) obtained by constructing the Aronhold operator for the two forms 
L’ Q, fm. These are also given in the particular cases shown. 


m=4 
0 
= G1 + a2 + G3; Dac 3a, * 
= 41 + 22, = + 23. 
m= 5 


0 
Cro = + + + da, * da,’ 
= dz + 3 22, = a2 xi + a3 Cos = QC 
m= 6 


0 0 re] 0 


Cin = (ar + a3) a1 + (a3 + a5) a2, Cox = a3 Q, Cros = a3 L, 


4 2 2 t 


C200 Co30 = a3; 


m=7 


6 
= Gy + + a3 + + + 


04;’ 
Coo1 = (a2 + a4) + (3 + 


= (ao'+"a1 + a3 + ds + + (a1 + + + + G7) 22, 
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Cire = (a1 + ag) + (a3 + a6) 23, 
Coos = (do + + a4) + (a1 + a3 + a5) + (G2 + + 23 


+ (a3 + a5 + a7) 23, 
Cros = (G1 + a2) at + (a5 + ag) 22, 


Corus = (do + + a2) + (a1 + ae + a3) + (a4 + + 
+ (as + a + a7) 22. 


2. COVARIANTS LED BY ASSIGNED SEMINVARIANTS 


If we add to the variables ao, a1, +--+; 21,2 in any formal covariant 
o(do, 21, of the increments of these variables when f,, is trans- 
formed by 21 = 2, 2% = tz; + 22, and then expand (a + dao, -++) by 
Taylor’s theorem, we find that any formal covariant modulo p has an anni- 
hilator of the type* 


9 


0 re) 
= Oo+ + x} 


In this theory ¢ is any residue modulo p and the expansion in question con- 
tains only the p terms obtained by reducing all powers of t below the pth by 
Fermat’s theorem. The operator 0; (j = 0,1, ---) is a partial differential 
operator in the derivatives with respect to the coefficients of f,,, non-homo- 
geneous as to the derivatives the orders of which range from zero to infinity 
in each O;. 

If we apply T to a covariant and proceed as in the well-known proof of 
Roberts’ theorem on the unique determination of an algebraical covariant 
from its seminvariant leader we find that the resulting relations in the co- 
variant’s coefficients and the operators 0; (7 = 0,1, ---) are not recurrent 
in the formal theory, whereas they are recurrent in the algebraic theory. 
Thus a formal covariant is not uniquely determined from its leader. This 
is also evident from the fact that if 7 is any such covariant of order w + 0 
(mod p), then 


is also a formal covariant having the same seminvariant leader. 

Under definite conditions we can, however, derive a covariant 7' of order 
p — 1 of fn having a given seminvariant S as leader. This is done by sub- 
stituting for ao, a1, d2, «++ in S the derivatives 
~ *Cf. Dickson, these Transactions, vol. 8 (1907), p. 209. An explicit 2 (6), 


analogous to Oo, is given in my paperin American Journal of Mathematics, 
loc. cit., p. 75. 


0 
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Mm Ox m(m—1) 
regarding 21, 22 as integers modulo p and rearranging the result as a poly- 
nomial in z;, x2 of order p— 1. This is analogous to the process of Faa di 
Bruno for algebraic covariants. The same result is reached by expanding 
S(a),a,, +++), corresponding to the transformation of fn by 

2, te +22, 
replacing ¢ by x2/z, and multiplying by z?. The result is 


Oo 8 


This is a formal covariant only provided it satisfies the differential congruence 
TT = 0 (mod p), and is symmetrical under the substitution 


= (Go Om) (G1 Gm—1) (122). 


Imposing these conditions we deduce the conditions which S must satisfy in 
order that it may lead the covariant of order p — 1. These are, (a) that S 
should be transformed into 0{! S/ p — 1 by s, and (6) that the following 
congruences should be satisfied: 


038 =0 
+ 0,0,7)S = 0, 
(3) + 01 OF? + 02 S =0, 


(05 + 0: OF + 0205 + + =0, 
(Oo + 01 Oo + 02 + + 0,1 087) S =0. 


The theory of 7 is thus complicated. It is given here because nothing has 
been published hitherto on the determination of a covariant with a given 
leader, and also for the reason that it is of much practical use in constructing 
covariants with assigned properties in special cases. Let p=2, m= 3. 
A seminvariant of f; is S = aga, + aj. Transformation of f; by 21 = 21, 
Xo = tx; + 22 gives 


S’ = [ao + (a1 + a2 + a3)] (a2 + tas) + (a; + tas)? 
= dy dz + aj + (ao a3 + a de + a3 + a3) t (mod 2). 


Rendering S symmetrical with the coefficient of ¢ (condition (a)) by adding 
the invariant do a3 + a, a2 = A, (annihilated by O)) we have the formal 
linear covariant* 


Ill 


*Cf. American Journal of Mathematics, loc. cit., p. 78. 


— 
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CP = (S + + (A + ai as + a3) ae. 


For p = 3, m = 2, S = aga} — a) a, the method leads easily to the quad- 
ratic covariant of fe, 


Ce = + (ao a2 — a) a2) + (a1 — aj a2) 23. 


3. SOME COVARIANTS OF f, MODULO 2 

For the modulus 2 the real points (21, 22), other than (0,0), in the 
plane are (1,1), (0,1), (1,0). The values of f,, at these points are, 
respectively, 

Qo + a1 + Om >» ao. 

By a theorem* of Dickson’s, any symmetric function of these three expressions 
is a formal invariant, modulo 2, all such being, of course, rational in the three 
elementary symmetric functions 


I= (do + Gn) ant K), 


(4) 
k = ao (do + K + am) Gm. 


In the same way if 


Cu = Ag xi + Ai ai + 
is any formal covariant of f,,, modulo 2, any symmetric function of 

+ Ayn, Ao, Ax 
is an invariant of the original form f,. Thus the middle coefficient of any 
quadratic covariant of f,, is an invariant, the sum of the two middle coefficients 
of any cubic covariant is an invariant, and so on. 


By means of the transformations on the a’s induced by 2 = 2 + tz, 
= 22, Viz. 


(r=0,-++, m), 
it is readily shown that 


(51) Ke = agai + Kay 22 + Gm 22, Ki = (ao + K) ai + (K + Gm) 22, 


are formal covariants modulo 2. 
Lemma. For any odd order m > 1 (p = 2) the form fm has a cubic covariant 
Kms with leading coefficient ay, which is such that 


Kms = Ke (mod 2) 


for integral values of 21, x2. 


* These Transactions, vol. 15 (1914), p. 497. 
Trans, Am. Math, Soc, 36 


— 
— 
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The covariants K,»3 for the first twelve odd orders m are 
K33 = dot} + + + a3 22, 
= + (a, + a2) + (a3 + a4) 21 22 + 22, 
= + (a, + a2 + a4) + (3 + as + Og) + 22, 
= do + (ay + + + + (5 + + + Og) + Oy 22, 
Kurs = do ti + (a1 + a2 + ag + a5 + Og) 
+ (a3 + + + dy + 21 23 + 22. 


| 


| 


The general form to be proved is 


3 2 2 3 
Kms = + xj + + Gn 22, 
where 


I, = a;, + +++ +4;,, Ip = aj, + +++ s=43(m-—1), 
=K. 


Assuming the increments of J,, Iz under (5) to be I, t, I, t we readily show 


that 
I, =I, = a (mod 2). 


Hence, letting J; = a; a; + +++ + Qm—1Gm—1 (a; = 0 or 1), we have from 
> 721 a, a,, with (5), the conclusion that the selection of the set ai,, ai,, 


-++,@;, is accomplished by selecting a set o, of columns from the following 
Pascal triangle (6), the set to have the properties (a), (b) below: 


(3) (m™1): 
a; a2 a3 An-2 Qm-1- 


(a) The sum of the numbers of the first row of the selected set o; is odd. 

(b) The sum for all of the other rows of a; is even. 

Grant for the moment that such a set can be selected. If its columns be 
deleted from (6) the set a2 of columns remaining in (6) will also have the 
properties (a), (b) since Pascal’s triangle is symmetrical with respect to the 
median drawn from This second set o2 gives a;,, a;,, @;, ,1.¢., Ie. 

We now construct the triangle ABC made up of the residues modulo 2 of 
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Pascal’s triangle, placing (7) at the upper vertex, with the aforesaid median 
drawn vertically. We note particularly the symmetrical properties, and in 
particular the regular recurrence of the inverted triangles of zeros, of increasing 
dimensions, having the median as a line of symmetry. Consider any ele- 
ment e of any horizontal row, to the left of the median. We call the column 
parallel to AB and containing e the column of e. The element e’ in the same 
row as ¢ but occupying the complementary position to the right of the median 
is called the complementary element of e. The column parallel to AB con- 
taining e’ is the complementary row of ¢, and the column containing e and 
parallel to BC is the complementary column of e. We have e’ = e, and 
the complementary row and complementary column of e identical. 


B 
1°1 
0 , 
1 
1 1/1/1/1/1 
0 070 0 
1 1 
1/0/0/0/0 0 1 0 
0 1f/0/0 0 1 0 0 0 
0.007 07 07 0 6. 8 
Gy Gs A, a; a, 


Suppose we wish to select J, i. e., the set a1, form = 19. We take the 
left-hand half-row number 18 from B as hypotenuse and draw the triangle 
enclosing the columns of these elements. The elements on the hypotenuse 
correspond to set 

T G1, G2, G4, 5, As, G7, Ag, Ay. 


If the rows enclosed within this triangle satisfy conditions (a), (b) (as is the 
case for m = 17) then o; would be the set 7; and J; equal to the sum of the 
$(m —1)a’s in their natural order beginning with a,. The sums of the 
rows above the first in the triangle are however 1,1,0,1,0,1,0,1 (mod 2) 
respectively, instead of 0, 0, 0, 0, 0, 0, 0, 0 (mod 2) required by condi- 
tion (b). To secure a triangle (augmented) which does satisfy (b) we replace 
some of the elements on the hypotenuse by their complementary elements, 
in this case e3, és (corresponding to a3, as) by e3, es (corresponding to aig, a4). 


— 
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The columns of e3, ¢; are thus replaced by their complementary columns by 
rotating the latter counter-clockwise around e;, ¢;. ‘The augmented triangle 
now satisfies conditions (a), (b) and hence 


Ty = + + aig + + + + + Og + 


Now the orders m for which the triangles do not need to be augmented are 
those giving the first row below the base of each central triangle of zeros. 
These values of m are 


(7) 3, 5, 9, 17, 33, ---, 2 +1, --- 


and 2" — 1 is the number of the row, counting downward from B, in which 
the base of the nth central triangle of zeros is found. For m = 2% +1, J; 
equals the sum of the first 2”~! a’s in their natural order, J; = a; + a2 +--+. 
Between any two consecutive cases of (7) there is a cycle of augmented tri- 
angles such that the whole configuration from B downward forms a sequence 
of recurring cycles of figures proceeding according to the laws of symmetry 
of the triangle ABC. Thus the set o; can always be selected and the lemma 
is true. 

We now observe that any covariant Cy of f, of odd order M has corre- 
sponding to it a quadratic and a cubic covariant of f,, constructed as co- 
variants of Cy, on the models of Kz, Kms respectively. (Cf. § 6.) 


4. THE FINITENESS OF THE FORMAL CONCOMITANTS FOR THE ORDER ™ AND 
MODULUS 2 
THEOREM. The general quantic fn, m > 3, is reducible, modulo 2, in terms 
of Q, L, and its own covariants of the first degree and orders 1, 2,3. 
Let m be even, m = 2k. Then f, — Q*" Kz has the factor 22. Hence 
it has the factor L, since the real points (mod 2) form a conjugate set under 
the group G. Therefore 


(8) fm = Q**K,+LC (mod 2), 


where C is a first degree covariant of f, of odd order 2k — 3. Next let m 
be odd, m = 2h+3>3. Then 


(9) fm = Knz + LC’ (mod 2), 


where C’ is of even order 2h. 

The forms C, C’ are, in turn, reducible in terms of their own covariants of 
orders 2, 3 with LZ and Q, and, as a covariant of C is a covariant of f,, we are 
led to a recurring process by which f,, is in all cases expressed in terms of its 
covariants of orders 1, 2,3, and L and Q, which was to be proved. 

Any concomitant of a polynomial in a set of concomitants is a function of 


_ 
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concomitants of the forms in the set. Hence if the systems for the orders 
1, 2, 3, and the simultaneous systems modulo 2 for these three orders are 
finite, the system for f, is likewise finite (see § 6). 


5. SYSTEMS OF THE FIRST DEGREE 


While illustrating formulas (8), (9), we now derive fundamental systems 
of first degree concomitants for the quartic and the quintic forms. 

An independent set of linear seminvariants of f, is a, a1, @2 +43. The 
only linear invariant is Cox9 = a; + a2 + a3 (= K, m = 4 (§$3)). Multi- 
plication of K; of (5;) and Cio: of § 1 by powers of Q gives covariants of 
any odd order led by a1, a9 + K, and, by subtracting such covariants, co- 
variants led by ay + a2 + ag. 

Lemma. There exists no covariant of odd order led by K. 


Assume such a covariant in the form C = Kaj"*' + Izj" xz. +--+. Then 
from (5), m = 4, 
K (aj'*" + +++) +--+) =C (mod 2), 
where I becomes I’ under (5). Hence, writing I’ = I + th, 
(mod 2). 


Now if I contains a,, the increment J; contains a), whereas K does not. 
Hence I does not contain a,. But the increment to a linear function of 
ao, 41, G2, a3 contains a, only, whereas K contains ag. Hence K # I;, a 
contradiction. 

It now follows that the general form of a covariant of odd order is 


C = [dai + + ae + a3) + 
C= (A+ Q Cin + Ki + IT (mod 2), 


where I is of even order 2k — 2. 
Any covariant of even order may be written 


Cy = [Ado + mai + v (a2 ai" + 
and we have, using K2 (m = 4) of § 3, and Coy: of § 1, 
C, = + (u + 7) Cow] + + LT, (mod 2); 


T’; being a first degree covariant of odd order 2h — 3. Hence the fundamental 


system sought is 
(10) K, Ki, Cow, L, Q. 


The form f;, itself is reducible as follows [cf. (8)]: 
fa = QKo + L( Ki + Ci) (mod 2). 


Thus 


— 
z= 
| 
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Employing K;, Kz, K, Kms (m = 5) of §3 and Coo1, Cioz of § 1, we find 
for the required fundamental system of the quintic, 


(11) K, Ki, Kss, Con C02 L, Q. 
The reduction of the form f; itself is given by 
fo = QKss +L (K2+Cim)  (mod2) [ef. (9)]. 


6. ON THE COMPLETE SYSTEM OF THE CUBIC, MODULO 2 


The development of the asyzygetic theory of the concomitants of a form 
often precedes the derivation of its complete system. In this section we 
show that every covariant of order > 3 of a binary cubic, modulo 2, is quasi- 
reducible (i. e., reducible on multiplication by K* (s = 0)) in terms of a set 
of fourteen concomitants, nine covariants and five invariants. 

It is known that the fundamental system of seminvariants* of f; modulo 2 is 


12) do, K=a+ = (do + K + a3) az, 
A=aa;t+aa, B=ai+aoqa, 
and also that f; has the invariants 
A, I =a, +a) K + bo, k = ao doo, 
V =B(B+ +a) K)(A + 


In two previous papers (quoted above) I have shown that f; has the covariantst 


(13) 


H = sxj + Ax, r2 + (a; a3 + a3) 23, 


G, = (A+s8)2, + (A +a) 43 + a3) 22 (8 =aom+a?), 
P = + a) + a3 2123 + a3 22, 
(14) Ki = (a9 + K)a, + (K + 43) 22, 


Kz = do + Kx, 22 + a3 22, 
(aj + K?) a, + (K? + 43) 


Co = xi + K2 21 22 + a3 23. 


ll 


There is, therefore, a covariant led by 8, viz., 


t= KK, + H = Bri + (A + K*) a1 22 + (a2 a3 + a3) 23. 


* Dickson, Madison Colloquium Lectures, p. 53. 
tCf. American Journal of Mathematics, loc. cit., p. 78. The forms 
co, ce in Table I are reducible as follows: 


CD =LKi+QK:, CY =Kt+(4+ K*) K:(mod2). 
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None of the invariants A, g (below), J can be leading coefficients of covariants 
of odd order, but there exists a cubic covariant led by the invariant K, viz., 


G = QKi + fs = + (ao + a1 + 3) (ao + ae + a3) 21 23 + Ka}. 


Taking the elementary symmetric function of second degree in the coefficients 
of t, as explained in § 3 (4), we have the fourth degree invariant of f;, 


(15) +6 +6i(A + K*) + Bi (61 = a2 a3 + a3), 
= 6? ++ B(A + K*) + (A + b00) (8 + ao K + K’) (mod 2). 
Now from (13), (15) we have 
(16) + B(a) + aK +1 + K’) 
+ (a, +aK+I+A)(a K+ K*)+9=0. 


Any seminvariant ¢ of f3, being a polynomial in the seminvariants (12), is a 
polynomial in ay, K,6,A, I; 


@ = ¢(a,8,K,A,1). 


We now use congruences (16) as reducing moduli, whereupon we are able to 
reduce all exponents* of 8 (in ¢) below 2 and all exponents of ap below 3. 
Thus any seminvariant can be reduced to the form 


=Jot Jiao +J2a5 + (To 


wherein J;, '; (¢ = 0, 1, 2) are invariants (expressed in terms of K, A, k, 
I, g) some of which might be zero. Let C,, be any covariant of f; of even 
order M = 2h, led by ¢; Cy = ozi" +--+. Then, 


Cy = QO Jo + "(Kodi + C2 J2) 
+ iT) + (mod2), 


where C is a covariant of odd order 2h — 3. Thus every covariant of even 
order > 3 is reducible in terms of our covariants and invariants. 
Proceeding to covariants of odd order; there is a covariant with semin- 


variant leader B = 6 + A, viz., 
L = Bai + (do a2 + ai + a1 a3 + a3) 


+ (do + a1 + + aj) + (A + ae ag + 03) 23 


+ Kfs. 


+ Cf. L. J. Reed, “Some fundamental systems of formal modular invariants and covariants,’’ 
Dissertation, University of Pennsylvania, 1915, § 3. 


é 
. 
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Replacing 8 by B + A in (16) and in ¢ we can reduce any seminvariant to 


the form 
= Ro + Ri ao + Reap + (So + Si ao + Sz a5) B, 


where R;, S; (¢ = 0,1, 2) are polynomials in the five invariants K, A, k, 
I,g. Let Cy be a covariant of f3 of odd order N = 2h + 3 > 3, led by 9; 
Cy = +--+. Then 
KC Ro G KR, fs KR, Q* P 
+ K (IS, Q* + IS; Kz, Q* + IS, C, + LC’ (mod 2). 

We have now proved the following:* 

THEOREM: Every formal covariant modulo 2 of order > 3 of the binary cubic 
fs is quasi-reducible, upon multiplication by K* (s = 0), in terms of the fourteen 
concomitants listed below: 


A, k, fs; Ko, C2, K,, G4, t, L, Q. 


“*On the subject of quasi-reducibility in ternariant theory, cf. Forsyth, American 
Journal of Mathematics, vol. 12 (1890). 
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